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Novel Serine Protease Genes Related to DPPIV 

Field of the Invention 

The present invention relates to novel serine proteases related to dipeptidyl 
peptidase IV PPPIV), and to isolated nucleic acids coding for these proteases, all of 
which are useful for the discovery of new therapeutic agents, for measuring protease 
5 activity, and for determining the inhibitory activity of compounds against these 
proteases. 

Background of the Invention 

Proteases and peptidases are enzymes that catalyse the hydrolysis of peptidic 
amide bonds. Proteases play an important role in the regulation of biological processes 

10 in almost every life-form from bacteria to virus to mammals. They perform critical 
functions in, for example, digestion, blood clotting, apoptosis, activation of immune 
responses, zymogen activation, viral maturation, protein secretion and protein 
trafficking. They can be classified according to a number of criteria, such as site of 
action, substrate preference, and mechanism. So, for example, aminopeptidases act 

15 preferentially at the N-terminal residues of a peptide, while carboxypeptidases act 
preferentially at the C-terminus and endopeptidases act at sites removed from the two 
termini. Among the carboxy- and aminopeptidases, peptidyl peptidases cleave a single 
amino acid residue from the substrate, dipeptidyl peptidases cleave a dipeptide unit (two 
amino acids) from the substrate, and tripeptidases cleave three amino acids from the 

20 substrate. Substrate preference is frequently expressed in terms of the amino acid 
residue immediately N-terminal to the cleavage site. For example, trypsin-like 
peptidases will preferentially cleave a peptide next to a basic amino acid (arginine or 
lysine), i.e. where the bond hydrolysed is the Arg/Lys-Xaa bond. As another example, 
the chymotrypsin-like family of peptidases preferentially hydrolyse peptides adjacent to 

25 an aromatic residue. Mechanistically, peptidases are classified as being serine- 
dependent, cysteine-dependent, aspartic acid-dependent or zinc-dependent. 

Because peptidases and proteases are involved in the regulation of many 
physiological processes, they are attractive targets for the development of therapeutic 
agents. Protease and peptidase inhibitors are, for example, used in the treatment of 

30 hypertension, coagulation disorders, and viral infection. 

Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, 
being found in viruses, bacteria and eukaryotes. Over 20 families (denoted SI - S27) of 
serine protease have been identified; these are grouped into 6 clans (SA, SB, SC, SE, SF 
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and SG) on the basis of structural similarity and other functional evidence. Structures 
are known for four of the clans (SA, SB, SC and SE); these appear to be totally 
unrelated, suggesting at least four evolutionary origins of serine peptidases and possibly 
many more, Rawlings and Barrett, Meth, Enzvmol. 244: 19-61 (1994). 
5 The prolyl oligopeptidase family consists of a number of evolutionarily related 

peptidases whose catalytic activity seems to be provided by a charge relay system similar 
to that of the trypsin family of serine proteases, but which evolved by independent 
convergent evolution. A conserved serine residue has been shown experimentally (in E. 
coli protease II as well as in pig and bacterial PE) to be necessary for the catalytic 

10 mechanism. This serine, which is part of the catalytic triad (Ser, His, Asp), is generally 
located about 150 residues away from the C-terminal extremity of these enzymes (which 
are all proteins that contains about 700 to 800 amino acids). 

One of the most intensively studied prolyl oligopeptidases is dipeptidyl peptidase 
IV (DPPIV, EC 3.414.5), a type II glycoprotein, which is the only well characterised 

15 dipeptidyl aminopeptidase known to be located on the outer side of plasma membranes. 
As indicated above, dipeptidyl aminopeptidases are characterised by their ability to cleave 
N-terminal dipeptides from a variety of small peptides. Dipeptidyl aminopeptidases show 
different substrate specificities and cellular localisation, suggesting different functions of 
each activity in peptide processing. DPPIV is characterised by its capacity to cleave N- 

20 terminal dipeptides containing proline or alanine as the penultimate residue. The DPPIV 
gene spans approximately 70 kb and contains 26 exons, ranging in size from 45 bp to 1.4 
kb. The nucleotide sequence (3,465 bp) of the cDNA contains an open reading frame 
encoding a polypeptide comprising 766 amino acids. The nucleotides that encode the 
active site sequence (G-W-S-Y-G) are split between 2 exons. This clearly distinguishes 

25 the genomic organisation of the prolyl oligopeptidase family from that of the classic serine 
protease family. 

DPPIV is widely distributed in mammalian tissues and is found in great 
abundance in the kidney, intestinal epithelium and placenta (Yaron, A. and Naider, F., 
Critical Reviews in Biochem. Mol. Biol. 1993 [1], 31). In the human immune system, 
30 the enzyme is expressed almost exclusively by activated T-lymphocytes of the CD4 + 
type where the enzyme has been shown to be synonymous with the cell-surface antigen 
CD26. Although the exact role of DP-IV in human physiology is still not completely 
understood, recent research has shown that the enzyme clearly has a major role in human 
physiology and pathophysiology. 
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On human T cells, DPPIV expression appears late in thymic differentiation and is 
preferentially restricted to the CD4 + helper/memory population, and CD26 can deliver a 
potent co-stimulatory T-cell activation signal. DPPIV, also known as T-cell activation 
antigen CD26, therefore plays an important role in the immune response via association 
5 with CD45 tyrosine phosphatase and, through its ability to bind adenosine deaminase 
(ADA) to the T-cell surface, protects the T-cell from adenosine-mediated inhibition of 
proliferation. Furthermore, the regulation of the function of chemokines by 
CD26/DPPIV appears to be essential for lymphocyte trafficking and infectivity of HIV 
strains. DPPIV has been associated with numerous functions including involvement in 

10 T-cell activation, cell adhesion, digestion of proline containing peptides in the kidney 
and intestines, HIV infection and apoptosis, and regulation of tumorigenicity in certain 
melanoma cells, Pethiyagoda et al, Clin. Exp. Metastasis 2000 : 18(5^:39 1-400. DPPIV 
is also implicated in the endocrine regulation and metabolic physiology. More 
particularly, DPPIV cleaves the amino-terminal His-Ala dipeptide of GLP-1, generating 

15 a GLP-1 receptor antagonist, and thereby shortens the physiological response to GLP-1. 
Giucagon-Iike peptide- 1 (GLP-1), an incretin that induces glucose-dependent insulin 
secretion, is rapidly degraded by DPPIV, and since the half-life for DPPIV cleavage is 
much shorter than the half-life for removal of GLP-1 from circulation, a significant 
increase inGLP-1 bioactivity (5- to 10- fold) is anticipated from DPP-IV inhibition. 

20 Inhibitors of DPPIV are currently being studied in the clinic as potential therapeutic 
agents for type 2 diabetes and impaired glucose tolerance. 

Various different inhibitors of DPPIV were known in 1993. One of these is a 
suicide inhibitor N-Ala-Pro-0-(nitrobenzoyl-) hydroxylamine. Another is a competitive 
inhibitor: e-(4-nitro) benzoxycarbonyl-Lys-Pro, and another is a polyclonal rabbit anti- 

25 porcine kidney DPPIV immunoglobulin. Others have since been developed and are 

described in detail in U.S. Patents Nos. 5,939,560, 6,1 10,949m 6,01 1,155 and 5,462,928. 

In addition to, but independent of, its serine type catalytic activity, DPPIV binds 
closely to the soluble extracellular enzyme adenosine deaminase (ADA), acting as a 
receptor and is thought to mediate signal transduction. DPPIV structure is characterized 

30 by two extracellular domains, an ot/p fold hydrolase domain and a 7-blade beta-propeller 
domain consisting of repeated beta sheets of about 50 amino acids. Recently it has been 
shown that, besides selecting substrates by size, the beta-propeller domain, containing 10 
of the 12 highly conserved cysteine residues, contributes to catalysis of the peptidase 
domain. In addition, the cysteine-rich domain is responsible for DPPIV-binding to 

35 collagen I and to extracellular ADA. DPPIV is also reported to play a role in fibronectin- 
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mediated interactions of cells with extracellular matrix. Recent studies show that the 
protease activity of DPPIV is not required for its anti-invasive activity because mutants 
of DPPIV that lack the extracellular serine protease activity maintain such activity. 

A number of proteins that share similarities with DPPIV have been reported in 
5 the literature. Several of these proteins have been cloned including DPP-I, DPP-II, 
DPP-in, DPP-X and fibroblast activation protein (FAP). These have been identified and 
characterised either by molecular cloning and functional studies of expressed proteins or 
as biochemical activities in tissue extracts. DPPIV-beta and other novel peptidases with 
functional similarities to DPPIV are not yet cloned. The identification, characterization 

10 and/or appropriate classification of further members of the family of prolyl 
oligopeptidases, the elucidation of their physiological (and particularly 
pathophysiological) role, and the application of that knowledge to the development of 
new therapeutic agents are significant challenges. 

Summary of the Invention 

15 The present invention provides proteins with prolyloligopeptidase (post-proline 

cleaving) activities that constitute three novel members of a family of proteins related to 
DPPIV, including the full-length proteins, alternative splice forms, subunits, and 
mutants, as well as nucleotide sequences encoding the same. The present invention also 
provides methods of screening for substrates, interacting proteins, agonists, antagonists 

20 or inhibitors of the above proteins, and furthermore to pharmaceutical compositions 
comprising the proteins and/or mutants, derivatives and/or analogues thereof and/or 
ligands thereto. 

These novel proteins having significant sequence homology to DPPIV are termed 
dipeptidyl peptidase IV-related protein- 1, 2 & 3 (DPRP-1, DPRP-2 and DPRP-3). The 
25 amino acid sequences of DPRP-1, DPRP-2 and DPRP-3 are given in SEQ. ID NOS: 1, 3 
and 5 respectively. Further disclosed are nucleic acid sequences coding for these 
proteins (SEQ. ID NOS:2, 4 and 6). Table 1 illustrates the homology (i.e. similarity) 
between the novel proteins DPRP-1, DPRP-2 and DPRP-3 and other known serine 
proteases. 



-4- 



WO 02/31134 



PCT/US01/31874 



Table 1 - Comparison of the sequences of these three novel proteins with DPPIV 
and other Clan SC, Family S9 members and Subfamily B members 



5 
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766 
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15q22.1-15q22.2 
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864 


39 
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7.5-8.0 


DPRP-3 


796 
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2q12.3-2q14.1 





The greatest homology between DPRP-1, DPRP-2 and DPPIV is seen in the C- 
terminal sequences. On the basis of sequence homology with DPPIV (see Figure 1), one 

15 might predict that these DPRP proteins would have functions that include, but are not 
limited to, roles as enzymes. Cloning, expression, biochemical and molecular 
characterization have confirmed this hypothesis. 

The expression pattern of DPRPs and the localization to specialized epithelial 
cells and plasma cells (Leydig cells, prostate epithelial cells, lymphocytes, B cells) is 

20 consistent with a role in differentiation, proliferation and inflammation. The localization 
of the DPRP-1 gene in hormone sensitive cancers (breast, prostate, testicular), tissues 
regulated by testosterone and the abundant expression in poorly differentiated cancers, 
demonstrate that DPRP-activating or inhibiting molecules will have numerous 
therapeutic applications in the treatment of disorders characterized by disregulated 

25 growth, differentiation and steroid or polypeptide hormone synthesis and degradation. 
Data disclosed herein supports the hypothesis that DPRP-1 and DPRP-2 are involved in 
the regulation of proliferation of in vitro models of prostate and testis cancer well known 
to those skilled in the art. 

DPRP-1 and DPRP-2 activities described herein and their expression patterns are 

30 compatible with their having functional roles as physiological regulators of the immune 
and neuroendocrine systems through the enzymatic modification of biochemical 
mediators like peptides and chemokines. The numerous functions previously described 
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for DPPIV based upon the use of inhibitors may be due in part to its action and that of 
similar proteins, like the DPRPs. Therefore, the discovery of selective and potent 
inhibitors of DPPIV, of the DPRPs and of other related proteases like FAP is considered 
central to achieving effective and safe pharmaceutical use of these and any newly 
5 identified serine protease inhibitors, as well as other active compounds that modify the 
function(s) of such proteins. 

The invention thus provides novel proteins or polypeptides, the nucleic acids 
coding therefor, cells which have been modified with the nucleic acid so as to express 
these proteins, antibodies to these proteins, a screening method for the discovery of new 

10 therapeutic agents which are inhibitors of the activity of these proteins (or which are 
inhibitors of DPPIV and not of the proteins), and therapeutic agents discovered by such 
screening methods. TIte novel proteins and the nucleic acids coding therefor can be used 
to discover new therapeutic agents for the treatment of certain diseases, such as for 
example, reproductive, inflammatory and metabolic disorders and also in the preparation 

15 of antibodies with therapeutic or diagnostic value. 

In accordance with one aspect of the present invention, there are provided novel, 
mature, biologically active proteins, principally of human origin. Such proteins may be 
isolated in small quantities from suitable animal (including human) tissue or biological 
fluids by standard techniques; however, larger quantities are more conveniently prepared 

20 in cultures of cells genetically modified so as to express the protein. 

In accordance with another aspect of the present invention, there are provided 
isolated nucleic acid molecules encoding polypeptides of the present invention including 
mRNAs, DNAs, cDNAs, genomic DNAs thereof. 

In accordance with a further aspect of the present invention, nucleic acid probes 

25 are also provided comprising nucleic acid molecules of sufficient length to specifically 
hybridize to a nucleic acid sequence of the present invention. 

In accordance with a still further aspect of the present invention, processes 
utilizing recombinant techniques are provided for producing such polypeptides useful for 
in vitro scientific research, for example, synthesis of DNA and manufacture of DNA 

30 vectors. Processes for producing such polypeptides include culturing recombinant 
prokaryotic and/or eukaryotic host cells that have been transfected with DNA vectors 
containing a nucleic acid sequence encoding such a polypeptide and/or the mature 
protein under conditions promoting expression of such protein and subsequent recovery 
of such protein or a fragment of the expressed product. 
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In accordance with still another aspect, the invention provides methods for using 
DPRP polypeptides and polynucleotides, including the treatment of infections, such as 
bacterial, fungal, protozoan and viral infections, particularly infections caused by HIV-1 
or HTV-2, pain, diabetes, precocious puberty, infertility, obesity, anorexia, bulimia, 
5 Parkinson's disease, acute heart failure, hypotension, hypertension, urinary retention, 
osteoporosis, angina pectoris, myocardial infarction, stroke, ulcers, asthma, allergies, 
benign prostatic hypertrophy, cancers including hormone-sensitive and androgen- 
independent cancers, migraines, vomiting, psychotic and neurological disorders, 
including anxiety, schizophrenia, manic depression, depression, dementia, and severe 

10 mental retardation, and dyskinesias, hereinafter collectively referred to as "the Diseases' 1 . 

In accordance with yet another aspect of the present invention, there is provided a 
process for utilizing such polypeptides, or polynucleotides encoding such polypeptides, 
for the discovery of compounds that inhibit the biological activity of the mature proteins 
thereof, e.g. by cleaving an N-terminal dipeptide, and such inhibitors are thus also 

15 provided. 

In accordance with a more specific aspect, the invention provides iolated nucleic 
acid which encodes (a) a polypeptide which includes the amino acid sequence of one of 
SEQ ID NOS: 1, 3 and 5, or (b) a polypeptide having an amino acid sequence that is at 
least about 70% similar thereto and exhibits the same biological function, or which is an 

20 alternative splice variant of one of SEQ ED NOS:2, 4 and 6, or which is a probe 

comprising at least 14 contiguous nucleotides from said nucleic acid encoding (a) or (b), 
or which is complementary to any one of the foregoing. 

In accordance with another specific aspect, the invention provides a polypeptide 
which may be optionally glycosylated, and which (a) has the amino acid sequence of a 

25 mature protein set forth in any one of SEQ ID NOS: 1, 3 and 5; (b) has the amino acid 
sequence of a mature protein having at least about 70% similarity to one of the mature 
proteins of (a) and which exhibits the same biological function; (c) has the amino acid 
sequence of a mature protein having at least about 90% identity with a mature protein of 
any of SEQ ID NOS:l, 3 and 5; or (d) is an immunologically reactive fragment of (a). 

30 In accordance with still another specific aspect, the invention provides a method 

for the screening for a compound capable of inhibiting the enzymatic activity of at least 
one mature protein of the invention, which method comprises incubating said mature 
protein and a suitable substrate for said mature protein in the presence of one or more 
test compounds or salts thereof, measuring the enzymatic activity of said mature protein, 

35 comparing said activity with comparable activity determined in the absence of a test 
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compound, and selecting the test compound or compounds that reduce the enzymatic 
activity, and it also provides a method for screening for a compound capable of 
inhibiting the enzymatic activity of DPPIV that does not inhibit the enzymatic activity of 
at least one mature protein and a suitable substrate in the presence of one or more 
5 inhibitors of DPPIV or salts thereof, measuring the enzymatic activity of said mature 
protein, comparing said activity with comparable activity determined in the absence of 
the DPPIV inhibitor, and selecting a compound that does not reduce the enzymatic 
activity of said mature protein. 

These and other aspects of the present invention should be apparent to those 
10 skilled in the art from the detailed description which follows. 

Brief Description of the Drawings 
FIGS. 1 A and IB show the co-linear alignment of DPRP-1, DPRP-2, DPRP-3 
and DPPIV, with shading being supplied to indicate the same (black) or similar (gray) 
amino acid residues at a particular location. 
1 5 FIG. 2 is similar to FIG. 1 and shows co-linear alignment of human and mouse 

DPRP-2. 

FIG. 3 is a graph which shows the effects of various tetrapeptide amide inhibitors 
on dipeptidyl peptidase enzyme activity. 

FIGS. 4A-4C show the effects of three inhibitor compounds on the proliferation 
20 of PC3 prostate cancer cell lines at various doses. 

Detailed Description of the Preferred Embodiments 
In accordance with an aspect of the present invention, there are provided isolated 
nucleic acid sequences (polynucleotides), which encode the mature polypeptides having 
the deduced amino acid sequences of the three DPRP's (SEQ ID NOS:l, 3 and 5). 
25 The polynucleotides of this invention were discovered using a human testis 

cDNA library (DPRP-1), a human colon library (DPRP-2) and a human hypothalamus 
cDNA library (DPRP-3). Isolated nucleic acid for DPRP-1 contains an open reading 
frame encoding a protein of approximately 882 amino acids in length which is 
structurally related to human DPPIV, showing 26% identity, and 41% similarity over the 
30 entire human DPPIV protein sequence. Isolated nucleic acid for DPRP-2 contains an 
open reading frame encoding for a protein of approximately 864 amino acids, which is 
39% similar to the entire DPPIV amino acid sequence. Analysis of DPRP-1 and DPRP- 
2 primary amino acid sequence using hydrophobicity plots predicts that these two 
proteins do not have a transmembrane domain. Despite this fact, it is possible that these 
35 intracellular serine proteases are secreted upon cellular activation. Quiescent cell proline 
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dipeptidase (QPP) is a serine protease that is targeted to intracellular vesicles that are 
distinct from lysosomes (Chiravuri M, et al., J. Immunol. 2000 Nov 15;165(10):5695- 
702 ). This hypothesis expands the potential site(s) and scope of DPRP-1 and DPRP-2 
involvement in mechanisms for post-translational regulation of chemokines, cytokines, 
5 peptides and polypeptides. The full length DPRP-3 sequence contains 796 amino acids, 
a signal peptide from 1 to 48, and a transmembrane domain between 34 and 56. The 
mature protein is predicted to be a type II membrane protein and may be cleaved to 
produce a soluble form. The amino acid sequence is set forth in SEQ ID NO:5 , which 
was deduced from SEQ ID NO:6 and has 54% similarity with DPPIV. 
10 Amino acid sequence alignments of these polypeptides with members of the 

prolyloligopeptidase enzyme subfamily S9B show that all three DPRP proteins have 
overall sequence and structural homology to DPPIV and FAP. DPRPs are predicted to 
be a members of the enzyme Clan SC (Serine nucleophile) with catalytic residues in the 
order Ser, Asp, His and the active site sequence (G-W-S-Y-G). 

15 Table 2. Homology (i.e. similarity) between DPRP-1, DPRP-2, DPRP-3 and members 
of the prolyl oligopeptidase family S9B enzymes. 
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DPRP-1, DPRP-2 and DPRP-3 do not exhibit sequence similarity with any 
members of the classical serine protease families, chymotrypsin and subtilisin. The order 
20 of the catalytic triad residues is different in the three main related SC clan families: His- 
Asp-Ser in chymotrypsin, Asp-His-Ser in subtilisin and Ser-Asp-His in the prolyl 
oligopeptidases. 

As shown in Table 2, DPRP-3 has the highest homology with DPPVI (68% 
homology and 51% identity). Wada et al isolated cDNA clones for DPPVI, a DPPIV- 
25 related protein, from bovine, rat (Wada et al, Proc. Nat. Acad. Sci. 89 : 197-201 . (1992)) 
and human (Yokotani et al, Hum. Molec. Genet. 2 :1037-1039 (1993)) brain libraries. 
They demonstrated that, unlike DPPIV, the catalytic triad in DPPVI does not have the 
first serine residue. In DPRP-3 two of the amino acids in the catalytic triad 
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characteristic of the serine protease family are conserved. However, the serine residue 
itself is replaced by glycine. While the absence of the serine residue is likely to prevent 
protease activity at this site, it is possible that multiple other functions mediated by other 
functional domains of the protein remain intact. 
5 As briefly described above, DPPIV is a multifunctional molecule that exerts 

important functions depending on the expressed cells and tissues, in addition to its 
catalytic activity as a peptidase. DPRP-3 and DPPVI are also likely to maintain multiple 
functions despite the absence of an intact catalytic triad. For example, DPPVI has been 
implicated in the regulation of neuronal plasticity. DPPVI is highly expressed in the 

10 hippocampus, thalamus, hypothalamus and stiatum. In addition, developmental arrest 
and embryonic lethality of rump white Rw/Rw embryos is thought to be due to disruption 
of the DPPIV gene. Rw mutation is associated with a chromosomal inversion spanning 
30 cM of the proximal portion of mouse chromosome 5. Genomic analysis of the DPPVI 
gene on the Rw chromosome places the inversion breakpoint in the coding region 

15 resulting in loss of a significant fraction of the C-terminal region, Hough R.B. et al., 
Proc.Nat. Acad. ScL 95. 13800-13805 (1998). 

The human DPRP-1 gene, predicted to be 32668bp in length, has at least 22 
exons and eight transcripts. It maps to chromosome 15 (NT_0 10265) at position 
15q21.1 - 15q22.1. The lengths of predicted alternative splice variant transcripts vary 

20 between 602bp and 4523bp (see SEQ ID NOS: 7-22). This is in agreement with the 
multiple transcripts observed by Northern blot analysis (See Example 2). ESTs 
representing the transcripts were found in numerous tissues including senescent 
fibroblasts, T-lymphocytes, germinal center B-cells, germ cell seminoma, testis, 
melanocytes, uterus, ovary breast, multiple sclerosis lesions, pancreas and placenta. 

25 Human DPRP-2 belongs to a gene with at least 27 exons and nine splice variants 

(see SEQ ID NOS:23-40). One SNP was observed in the 3' UTR. (88% (37) C vs. 12% 
(5) T). The DPRP-2 gene maps to region 19pl3.3 of chromosome 19. This location is 
host to a number of disease markers and is associated with various disorders including 
hypocalciuric hypercalcemia, type II cerebellar ataxia, muscular dystrophy, convulsions, 

30 susceptibility to atherosclerosis, psoriasis, ectodermal dysplasia, and acute myeloid 
leukemia. In agreement with the ubiquitous distribution of the mRNA observed by 
Northern blot analysis (see Example 2), DPRP-2 was expressed in a wide variety of 
tissues upon examination of ESTs coverage (e.g. over 64 EST's expressed in Irver, 
spleen, muscle, melanocytes, heart, lung, placenta, skin, pancreas, stomach, brain 

35 parathyroid gland). 
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Human DPRP-3 belongs to a gene with at least 23 exons and two splice variants 
(see SEQ ID NOS:41-44). The gene maps to chromosome 2 (NT_005445) at position 
2ql2.3-2ql4.1. Transcripts for DPRP-3 did not show as wide a distribution as DPRP-1 
and DPRP-2. As shown by Northern blot in Example 2, DPRP-3 expression is restricted 
5 to brain and pancreas. ESTs representing the DPRP-3 mRNA were abundant in tissue 
derived from multiple sclerosis lesions, hypothalamus, whole brain and nerves, with a 
few transcripts being found in uterus and colon. 

The relationships among human and rodent proteases in clan SC, including 
DPRP-1 DPRP-2 and DPRP-3, were analyzed using Neighbor Joining method (NJ), see 

10 Saitou and Nei, Mol, Biol. EvoL 4. 406-525 (1987). Phylogenetic analysis shows that 
among the S9 proteases, DPRP-1 and DPRP-2, both lacking a transmembrane domain, 
are distinguished from DPPIV and its closely related proteins like FAP. Similarity is 
shown however between DPPIV and FAP and between DPRP-3 and DPPVT, which are 
all type II membrane proteins. 

15 A database search for additional DPRP-related genes revealed the presence of a 

murine sequence related to DPRP-1 . Alignment of this mouse sequence with the novel 
human proteases shows that the mDPRP-1 displays considerable homology with its 
human counterpart (FIG. 2). One skilled in the art will readily recognize that the novel 
mouse protease gene can be isolated using the sequence information disclosed herein and 

20 can be readily incorporated into one of the routinely used expression constructs which 
are well known in the art. Use of this disclosed sequence by those skilled in the art to 
generate a transgenic mouse model will employ development of gene-targeting vectors, 
for example, that result in homologous recombination in mouse embryonic stem cells. 
The use of knockout mice in further analysis of the function of DPRP genes is a valuable 

25 tool. 

The polynucleotides of the present invention may be in the form of RNA or in 
the form of DNA; DNA should be understood to include cDNA, genomic DNA, and 
synthetic DNA. The DNA may be double-stranded or single-stranded and, if single- 
stranded, may be the coding strand or non-coding (antisense) strand. The coding 
30 sequence which encodes the mature polypeptide may be identical to the coding sequence 
shown in SEQ ID NOS:2, 4 and 6 respectively, or it may be a different coding sequence 
encoding the same mature polypeptide, as a result of the redundancy or degeneracy of 
the genetic code or a single nucleotide polymorphism. For example, it may also be an 
RNA transcript which includes the entire length of any one of SEQ ID NOS:2, 4 and 6. 
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The polynucleotides which encode the mature proteins of SEQ ID NOS:l, 3, 5, 
respectively, may include but are not limited to the coding sequence for the mature 
protein alone; the coding sequence for the mature polypeptide plus additional coding 
sequence, such as a leader or secretory sequence or a proprotein sequence; and the 
5 coding sequence for the mature protein (and optionally additional coding sequence) plus 
non-coding sequence, such as introns or a non-coding sequence 5* and/or 3' of the coding 
sequence for the mature protein. 

Thus, the term "polynucleotide encoding a polypeptide" or the term "nucleic acid 
encoding a polypeptide" should be understood to encompass a polynucleotide or nucleic 

10 acid which includes only coding sequence for the mature protein as well as one which 
includes additional coding and/or non-coding sequence. The terms polynucleotides and 
nucleic acid are used interchangeably. 

The present invention also includes polynucleotides where the coding sequence 
for the mature protein may be fused in the same reading frame to a polynucleotide 

15 sequence which aids in expression and secretion of a polypeptide from a host cell; for 
example, a leader sequence which functions as a secretory sequence for controlling 
transport of a polypeptide from the cell may be so fused. The polypeptide having such a 
leader sequence is termed a preprotein or a preproprotein and may have the leader 
sequence cleaved, by the host cell to form the mature form of the protein. These 

20 polynucleotides may have a 5* extended region so that it encodes a proprotein, which is 
the mature protein plus additional amino acid residues at the N-terminus. The 
expression product having such a prosequence is termed a proprotein, which is an 
inactive form of the mature protein; however, once the prosequence is cleaved an active 
mature protein remains. Thus, for example, the polynucleotides of the present invention 

25 may encode mature proteins, or proteins having a prosequence, or proteins having both a 
prosequence and a presequence (leader sequence). 

The polynucleotides of the present invention may also have the coding sequence 
fused in frame to a marker sequence which allows for purification of the polypeptides of 
the present invention. The marker sequence may be a polyhistidine tag, a hemagglutinin 

30 (HA) tag, a c-myc tag or a V5 tag when a mammalian host, e.g. COS-1 cells, is used. 
The HA' tag would correspond to an epitope derived from the influenza hemagglutinin 
protein (Wilson, I., et al., Cell, 37 :767 (1984)), and the c-myc tag may be an eptitope 
from human Myc protein (Evans, G.I. et al., Mol Cell Biol 5: 3610-3616 (1985)). 

The term "gene" means the segment of DNA involved in producing a polypeptide 

35 chain; it includes regions preceding and following the coding region (leader and trailer) 
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as well as intervening sequences (introns) between individual coding segments (exons). 
The term "significant sequence homology" is intended to denote that at least 25%, 
preferably at least 40%, of the amino acid residues are conserved, and that, of the non- 
conserved residues, at least 40% are conservative substitutions. 

5 Fragments of the full-length genes of the present invention may be used as a 

hybridization probe for a cDNA library to isolate full-length cDNA as well as to isolate 
other cDNAs which have significant sequence homology to the gene and will encode 
proteins or polypeptides having similar biological activity or function. By similar 
biological activity or function, for purposes of this application, is meant the ability to 

10 cleave an N-terminal dipeptide having Ala or Pro as the penultimate residue or other 
amino acids. Such a probe of this type has at least 14 bases (at least 14 contiguous 
nucleotides from one of SEQ ID NOS:2, 4 or 6), preferably at least 30 bases, and such 
may contain, for example, 50 or more bases. Such probe may also be used to identify a 
cDNA clone corresponding to a full-length transcript and/or a genomic clone or clones 

15 that contains the complete gene, including regulatory and promoter regions, exons, and 
introns. Labelled oligonucleotides having a sequence complementary to that of the gene 
of the present invention are useful to screen a library of human cDNA, geijomic DNA or 
mRNA to locate members of the library to which the probe hybridizes. As an example, a 
known DNA sequence may be used to synthesize an oligonucleotide probe which is then 

20 used in screening a library to isolate the coding region of a gene of interest. 

The present invention is considered to further provide polynucleotides which 
hybridize to the hereinabove-described sequences wherein there is at least 70%, 
preferably at least 90%, and more preferably at least 95% identity or similarity between 
the sequences, and thus encode proteins having similar biological activity. Moreover, as 

25 known in the art, there is "similarity" between two polypeptides when the amino acid 
sequences contain the same or conserved amino acid substitutes for each individual 
residue in the sequence. Identity and similarity may be measured using sequence 
analysis software (e.g., Sequence Analysis Software Package of the Genetics Computer 
Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, 

30 Madison, WI 53705). The present invention particularly provides such polynucleotides 
which hybridize under stringent conditions to the hereinabove-described 
polynucleotides. As herein used, the term "stringent conditions" means conditions which 
permit hybridization between polynucleotides sequences and the polynucleotide 
sequences of SEQ ID NOS:2, 4 and 6 where there is at least about 70% identity. 

35 Suitably stringent conditions can be defined by, e.g., the concentrations of salt or 
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formamide in the prehybridization and hybridization solutions, or by the hybridization 
temperature, and are well known in the art. In particular, stringency can be increased by 
reducing the concentration of salt, by increasing the concentration of formamide, and/or 
by raising the hybridization temperature. 
5 . For example, hybridization under high stringency conditions may employ about 

50% formamide at about 37°C to 42°C, whereas hybridization under reduced stringency 
conditions might employ about 35% to 25% formamide at about 30°C to 35°C One 
particular set of conditions for hybridization under high stringency conditions employs 
42°C, 50% formamide, 5x. SSPE, 0.3% SDS, and 200 ng/ml sheared and denatured 

10 salmon sperm DNA. For hybridization under reduced stringency, similar conditions as 
described above may be used in 35% formamide at a reduced temperature of 35°C. The 
temperature range corresponding to a particular level of stringency can be further 
narrowed by calculating the purine to pyrimidine ratio of the nucleic acid of interest and 
adjusting the temperature accordingly. Variations on the above ranges and conditions 

15 are well known in the art. Preferably, hybridization should occur only if there is at least 
95%, and more preferably at least 97%, identity between the sequences. The n 
polynucleotides which hybridize to the hereinabove described polynucleotides in a 
preferred embodiment encode polypeptides which exhibit substantially the same 
biological function or activity as the mature protein encoded by one of the cDNAs of 

20 SEQ ID NOS:2, 4 and 6. 

As mentioned, a suitable polynucleotide probe may have at least 14 bases, 
preferably 30 bases, and more preferably at least 50 bases, and will hybridize to a 
polynucleotide of the present invention which has an identity thereto, as hereinabove 
described, and which may or may not retain activity. For example, such polynucleotides 

25 may be employed as a probe for hybridizing to the polynucleotides of SEQ ID NOS:2, 4 
and 6 respectively, for example, for recovery of such a polynucleotide, or as a diagnostic 
probe, or as a PGR primer. Thus, the present invention includes polynucleotides having 
at least a 70% identity, preferably at least a 90% identity, and more preferably at least a 
95% identity to a polynucleotide which encodes the polypeptides of SEQ ID NOS:l, 3 

30 and 5 respectively, as well as fragments thereof, which fragments preferably have at 
least 30 bases and more preferably at least 50 bases, and to polypeptides encoded by 
such* polynucleotides. 

As is well known in the art, the genetic code is redundant in that certain amino 
acids are coded for by more than one nucleotide triplet (codon), and the invention 

35 includes those polynucleotide sequences which encode the same amino acids using a 
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different codon from that specifically exemplified in the sequences herein. Such a 
polynucleotide sequence is referred to herein as an "equivalent"polynucleotide 
sequence. The present invention further includes variants of the hereinabove described 
polynucleotides which encode for fragments, such as part or all of the mature protein, 
5 analogs and derivatives of one of the polypeptides having the deduced amino acid 
sequence of SEQ ID NOS:l, 3 and 5 respectively. The variant forms of the 
polynucleotides may be a naturally occurring allelic variant of the polynucleotides or a 
non-naturally occurring variant of the polynucleotides. For example, the variant in the 
nucleic acid may simply be a difference in codon sequence for the amino acid resulting 

10 from the degeneracy of the genetic code, or there may be deletion variants, substitution 
variants and addition or insertion variants. As known in the art, an allelic variant is an 
alternative form of a polynucleotide sequence which may have a substitution, deletion or 
addition of one or more nucleotides that does not substantially alter the biological 
function of the encoded polypeptide. 

1 5 The present invention further includes polypeptides which have the deduced 

amino acid sequence of SEQ ED NOS:l, 3 and 5, as well as fragments, analogs and 
derivatives of such polypeptides. The terms "fragment," "derivative" and "analog", 
when referring to the polypeptides of SEQ ID NOSrl, 3 and 5, means polypeptides that 
retain essentially the same biological function or activity as such polypeptides. An 

20 analog might, for example, include a proprotein which can be activated by cleavage of 
the proprotein portion to produce an active mature protein. The polypeptides of the 
present invention may be recombinant polypeptides, natural polypeptides or synthetic 
polypeptide; however, they are preferably recombinant polypeptides, glycosylated or 
unglycosylated. 

25 The fragment, derivative or analog of a polypeptide of SEQ ID NOS: 1, 3 and 5 

respectively, may be (i) one in which one or more of the amino acid residues is 
substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not 
be one encoded by the genetic code, or (ii) one in which one or more of the amino acid 

30 residues includes a substituent group, or (iii) one in which additional amino acids are 
fused to the mature protein, such as a leader or secretory sequence or a sequence which 
is employed for purification of the mature polypeptide or a proprotein sequence. Such 
fragments, derivatives and analogs are deemed to be within the scope of those skilled in 
the art to provide upon the basis of the teachings herein. 
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The polypeptides and polynucleotides of the present invention should be in an . 
isolated form, and preferably they are purified to substantial homogeneity or purity. By 
substantial homogeneity is meant a purity of at least about 85%. 

The term "isolated" is used to mean that the material has been removed from its 
5 original environment (e.g., the natural environment if it is naturally occurring). For 
example, a naturally occurring polynucleotide or polypeptide present in a living animal 
is not considered to be isolated, but the same polynucleotide or polypeptide, when 
separated from substantially all of the coexisting materials in the natural system, is 
considered isolated. For DNA, the term includes, for example, a recombinant DNA 

10 which is incorporated into a vector, into an autonomously replicating plasmid or virus, or 
into the genomic DNA of a prokaryote or eiikaryote; or which exists as a separate 
molecule (e.g., a cDNA or a genomic or cDNA fragment produced by polymerase chain 
reaction (PCR) or restriction endonuclease digestion) independent of other sequences. It 
also includes a recombinant DNA which is part of a hybrid gene encoding additional 

15 polypeptide sequence, e.g., a fusion protein. Further included is recombinant DNA 
which includes a portion of the nucleotides shown in one of SEQ ID NO:2,4 or 6 which 
encodes an alternative splice variant of the DPRP. Various alternative splice variants are 
exemplified in SEQ IDNOS:8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 44 and 46. 

20 The polypeptides of the present invention include any one of the polypeptide of 

SEQ ID NOS:l, 3 and 5 (in particular the mature protein), as well as polypeptides which 
have at least 70% similarity (e.g. preferably at least 60% and more preferably at least 
70% identity) to one of the polypeptides of SEQ ID NOS: 1, 3 and 5, more preferably at 
least 90% similarity (e.g. preferably at least 90% identity) to one of the polypeptides of 
SEQ ID NOS:l, 3 and 5, and most preferably at least 95% similarity (e.g. preferably at 
least 95% identity) to one of the polypeptides of SEQ ID NOS:l, 3 and 5. Moreover, 
they should preferably include exact portions of such polypeptides containing a sequence 
of at least 30 amino acids, and more preferably at least 50 amino acids. 

Fragments or portions of the polypeptides of the present invention may be 
employed as intermediates for producing the corresponding full-length polypeptides by 
peptide synthesis. Fragments or portions of the polynucleotides of the present invention 
may also be used to synthesize full-length polynucleotides, of the present invention. 

The present invention also includes vectors which include such polynucleotides, 
host cells which are genetically engineered with such vectors and the production of 
polypeptides by recombinant techniques using the foregoing. Host cells are genetically 
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engineered (transduced or transformed or transfected) with such vectors which may be, 
for example, a cloning vector or an expression vector. The vector may be, for example, 
in the form of a plasmid, a viral particle, a phage, etc. The engineered host cells can be 
cultured in conventional nutrient media modified as appropriate for activating promoters, 
5 selecting transformants or amplifying the genes of the present invention. The culture 
conditions, such as temperature, pH and the like, are those commonly used with the host 
cell selected for expression, as well known to the ordinarily skilled artisan. 

The polynucleotides of the present invention may be employed for producing 
polypeptides by recombinant techniques. Thus, for example, the polynucleotides may be 

10 included in any one of a variety of expression vectors for expressing polypeptides. Such 
vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., 
derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast plasmids; 
vectors derived from combinations of plasmids and phage DNA, viral DNA such as 
vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, any other vector may 

15 be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by any of a 
variety of procedures. In general, the DNA sequence is inserted into an appropriate 
restriction endonuclease site(s) by procedures well known in the art, which procedures 
are deemed to be within the scope of those skilled in this art 

20 The DNA sequence in the expression vector is operatively linked to an 

appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. As 
representative examples of such promoters, there may be mentioned: LTR or SV40 
promoter, the E. colL lac or tip, the phage lambda P.sub.L promoter and other promoters 
known to control expression of genes in prokaryotic or eukaryotic cells or their viruses. 

25 The expression vector should also contain a ribosome binding site for translation 
initiation and a transcription terminator. The vector may also include appropriate 
sequences for amplifying expression. In addition, the expression vectors preferably 
contain one or more selectable marker genes to provide a phenotypic trait for selection 
of transformed host cells, such as dihydrofolate reductase or neomycin-resistance for 

30 eukaryotic cell culture, or such as tetracycline- or ampicillin-resistance in E. coll 

The vector containing the appropriate DNA sequence as hereinabove described, 
as well as an appropriate promoter or control sequence, may be employed to transform 
an appropriate host to permit the host to express the protein. As representative examples 
of appropriate hosts, there may be mentioned: bacterial celts, such as E. coli, 

35 Streptomyces, Salmonella typhimurium; fungal cells, such as yeast; insect cells, such as 
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Drosophila S2 and Spodoptera Sf9; animal cells, such as CHO, COS or Bowes 
melanoma; adenoviruses; plant cells, etc. The selection of an appropriate host is deemed 
to be within the scope of those skilled in the art from the teachings herein. 

Synthetic production of nucleic acid sequences is well known in the art as is 
5 apparent from CLONTECH 95/96 Catalogue, pages 215-216, CLONTECH, 1020 East 
Meadow Circle, Palo Alto, Calif 94303. Thus, the present invention also includes 
expression vectors useful for the production of the proteins of the present invention 

The present invention further includes recombinant constructs comprising one or 
more of the sequences as broadly described above. The constructs may comprise a 

10 vector, such as a plasmid or viral vector, into which a sequence of the invention has been 
inserted, in a forward or reverse orientation. In a preferred aspect of this embodiment, 
the construct further comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially available. The 

1 5 following vectors are provided by way of example: Bacterial: pQE70, pQE60, pQE-9 
(Qiagen), pBS, pDIO, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16a, 
pNH18A, pNH46A (Stratagene), ptrc99a, pKK223-3, pKK233-3, pDR540 and pRIT5 
(Pharmacia); and Eukaryotic: pWLNEO, pSV2CAT, pOG44, pXTl, pSG (Stratagene) 
pSVK3, pBPV, pMSG, and pSVL (Pharmacia). However, any other suitable plasmid or 

20 vector may be used as long as it is replicable and viable in the host 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol acetyl transferase) vectors or other vectors with selectable markers. 
Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda P.sub.R, P.sub.L and trp. Eukaryotic 

25 promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, 
LTRs from retrovirus, and mouse metallothionein-I. Selection of the appropriate vector 
and promoter is well within the level of ordinary skill in the art. 

Components of the expression vector may generally include: 1) a neomycin 
phosphotransferase (G418), or hygromycin B phosphotransferase (hyg) gene as a 

30 selection marker, 2) an E. coli origin of replication, 3) a T7 and SP6 phage promoter 
sequence, 4) lac operator sequences, 5) the lactose operon repressor gene (laclq) and 6) a 
multiple cloning site linker region. Such an origin of replication (oriC) may be derived 
from pUC19 (LTI, Gaithersburg, Md.). 

A nucleotide sequence encoding one of the polypeptides SEQ ID NOS:2,4 and 6 

35 having the appropriate restriction sites is generated, for example, according to the PCR 
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protocol described in Example 1 hereinafter, using PCR primers having restriction sites 
for Kpnl (as the 5 1 primer) and NotI or Sad (as the 3* primer) for DPRP-1, or sites for 
Hindlll (as the 5' primer) and NotI or BamHI (as the 3' primer) for DPRP-2. The PCR 
inserts are gel-purified and digested with compatible restriction enzymes. The insert and 
5 vector are ligated according to standard protocols. 

In a further embodiment, the present invention provides host cells containing the 
above-described constructs. The host cell can be a higher eukaryotic cell, such as a 
mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can be a 
prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell 

10 can be effected by calcium phosphate transfection, DEAE-Dextran mediated 

transfection, lipofection or electroporation (Davis, L., Dibner, M., Battey, L, Basic 
Methods in Molecular Biology, (1986)). 

Such constructs in host cells are preferably used in a conventional manner to 
produce the gene product encoded by the recombinant sequence. Alternatively, the 

1 5 polypeptides of the invention can be synthetically produced by conventional peptide 
synthesizers or by chemical ligation of suitable fragments thus prepared. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other 
cells under the control of appropriate promoters. Cell-free translation systems can also 
be employed to produce such proteins using RNAs derived from the DNA constructs of 

20 the present invention. Appropriate cloning and expression vectors for use with 

prokaryotic and eukaryotic hosts are described by Sambrook, et aL, Molecular Cloning: 
A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., (1989). 

Transcription of the DNA encoding the polypeptides of the present invention by 
higher eukaryotes is increased by inserting an enhancer sequence into the vector. 

25 Enhancers include cis-acting elements of DNA, usually about from 10 to 300 bp, that act 
on a promoter to increase its transcription. Examples include the SV40 enhancer on the 
late side of the replication origin bp 100 to 270, a cytomegalovirus early promoter 
enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus 
enhancers. 

30 Generally, recombinant expression vectors will include origins of replication and 

selectable markers permitting transformation of the host cell, e.g., the ampicillin- 
resistance gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a 
highly expressed gene to direct transcription of a downstream structural sequence. Such 
promoters can be derived from operons encoding glycolytic enzymes, such as 3- 

35 phosphoglycerate kinase (PGK), alpha-factor, acid phosphatase, or heat shock proteins, 
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among others. The heterologous structural sequence is assembled in appropriate phase 
with translation initiation and termination sequences, and preferably, a leader sequence 
capable of directing secretion of translated protein into the periplasmic space or 
extracellular medium. Optionally, the heterologous sequence can encode a fusion 
5 protein including an N-terminal identification peptide imparting desired characteristics, 
e.g., stabilization or simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a 
structural DNA sequence encoding a desired protein together with suitable translation 
initiation and termination signals in operable reading phase with a functional promoter. 

10 The vector will comprise one or more phenotypic selectable markers and an origin of 
replication to ensure maintenance of the vector and to, if desired, provide amplification 
within the host. Suitable prokaryotic hosts for transformation include E. coli, Bacillus 
subtilis, Salmonella typhimurium and various species within the genera Pseudomonas, 
Streptomyces, and Staphylococcus, although others may also be employed as a matter of 

15 choice. 

As a representative but non-limiting example, useful expression vectors for 
bacterial use can comprise a selectable marker and bacterial origin of replication derived 
from commercially available plasmids comprising genetic elements of the well known 
cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, 

20 pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, 
Madison, Wis., U.S.A.). These pBR322 "backbone" sections are combined with an 
appropriate promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g., 

25 temperature shift or chemical induction), and cells are cultured for an additional period. 
Cells are typically harvested by centrifugation and then disrupted by physical or 
chemical means, with the resulting crude extract being retained for further purification. 
Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption and use of 

30 cell-lysing agents; such methods are well known to those skilled in the art. 

Various mammalian cell culture systems can also be employed to express a 
recombinant protein. Examples of mammalian expression systems include the COS-7 
lines of monkey kidney fibroblasts, described by Gluzman, Cell. 23 :175 (1981). Other 
cell lines capable of expressing a compatible vector include, for example, the CI 27, 3T3, 

35 CHO, HeLa and BHK cell lines. Mammalian expression vectors will generally comprise 
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an origin of replication, a suitable promoter and enhancer, and also any necessary 
ribosome binding sites, polyadenylation site, splice donor and acceptor sites, 
transcriptional termination sequences, and 5 ! flanking nontranscribed sequences. DNA 
sequences derived from the SV40 splice, and polyadenylation sites may be used to 
provide required nontranscribed genetic elements. 

The polypeptides can be recovered and purified from recombinant cell cultures 
by methods including ammonium sulfate or ethanol precipitation, acid extraction, anion 
or cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography 
and lectin chromatography. Recovery can be facilitated if the polypeptide is expressed 
at the surface of the cells, but such is not a prerequisite. Recovery may also be desirable 
of cleavage products that are cleaved following expression of a longer form of the 
polypeptide. Protein refolding steps as known in this art can be used, as necessary, to 
complete configuration of the mature protein. High performance liquid chromatography 
(HPLC) can be employed for final purification steps. 

The polypeptides of the present invention may be purified natural products, or 
produced by recombinant techniques from a prokaryotic or eukaryotic host (for example, 
by bacterial, yeast, higher plant, insect or mammalian cells in culture). Depending upon 
the host employed in a recombinant production procedure, the polypeptides of the 
present invention may be glycosylated or may be non-glycosylated. Polypeptides of the 
invention may also include an initial methionine amino acid residue. 

In a preferred embodiment, the proteins of the invention are isolated and purified 
so as to be substantially free of contamination from other proteins. For example, the 
proteins of the invention should constitute at least 80% by weight of the total protein 
present in a sample, more preferably at least 90%, even more preferably at least 95%, 
and most preferably at least 98% by weight of the total protein. 

These proteins may be in the form of a solution in water, another suitable solvent, 
such as dimethyl sulphoxide (DMSO) or ethanol, or a mixture of suitable solvents. 
Examples of mixtures of solvents include 10% (by weight) ethanol in water and 2% (by 
weight) DMSO in water. A solution may further comprise salts, buffering agents, 
chaotropic agents, detergents, preservatives and the like. Alternatively, the proteins may 
be in the form of a solid, such as a lyophilised powder or a crystalline solid, which may 
also comprise a residual solvent, a salt or the like. 

As used herein, the term "antibodies" includes polyclonal antibodies, 
affinity-purified polyclonal antibodies, monoclonal antibodies, and antigen-binding 



-21- 




WO 02/31 134 PCTAJS01/31874 

fragments, such as F(ab ! ) 2 and Fab 1 proteolytic fragments. Genetically engineered intact 
antibodies or fragments, such as chimeric antibodies, Fv fragments, single chain 
antibodies and the like, as well as synthetic antigen-binding peptides and polypeptides, 
are also included. Non-human antibodies may be humanized by grafting non-human 
5 CDRs onto human framework and constant regions, or by incorporating the entire 
non-human variable domains (optionally "cloaking" them with a human-like surface by 
replacement of exposed residues, wherein the result is a "veneered" antibody). In some 
instances, humanized antibodies may retain non-human residues within the human 
variable region framework domains to enhance proper binding characteristics. Through 

10 humanizing antibodies, biological half-life may be increased, and the potential for 
adverse immune reactions upon administration to humans should be reduced. 

Alternative techniques for generating or selecting antibodies useful herein 
include in vitro exposure of lymphocytes to human prohormone DPRP protein or a 
peptide therefrom, and selection of antibody display libraries in phage or similar vectors 

15 (for instance, through use of immobilized or labeled human DPRP protein or peptide). 
Genes encoding polypeptides having potential human DPRP polypeptide binding 
domains can be obtained by screening random peptide libraries displayed on phage 
(phage display) or on bacteria, such as E. colu Nucleotide sequences encoding such 
polypeptides can be obtained in a number of ways well known in this art. 

20 As would be evident to one of ordinary skill in the art, polyclonal antibodies can 

be generated from inoculating a variety of warm-blooded animals, such as horses, cows, 
goats, sheep, dogs, chickens, rabbits, mice and rats, with a human DPRP polypeptide or 
a fragment thereof. The immunogenicity of a human prohormone DPRP polypeptide 
may be increased through the use of an adjuvant, such as alum (aluminum hydroxide) or 

25 Freund's complete or incomplete adjuvant, or surface active substances, such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH or 
dinitrophenoL Among adjuvants used in humans, BCG (bacilli Calmette-Guerin) and 
Corynebacterium parvum are especially preferable. Polypeptides useful for 
immunization also include fusion polypeptides, such as fusions of DPRP or a portion 

30 thereof with an immunoglobulin polypeptide or with maltose binding protein. The 
polypeptide immunogen may be a full-length molecule or a portion thereof. If the 
polypeptide portion is "hapten-like", such portion may be advantageously joined or 
linked to a macromoiecular carrier, such as keyhole limpet hemocyanin (KLH), bovine 
serum albumin (BSA) or tetanus toxoid, for immunization. Antibodies to DPRP may 

35 also be generated using methods that are well known in the art. Such antibodies may 
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include, but are not limited to, polyclonal, monoclonal, chimeric, and single chain 
antibodies, Fab fragments, and fragments produced by a Fab expression library. 
Neutralizing antibodies (i.e., those which block or modify interactions at the active sites) 
are especially preferred for therapeutic use. 
5 For the production of antibodies, binding proteins, or peptides which bind 

specifically to DPRP, libraries of single chain antibodies, Fab fragments, other antibody 
fragments, non-antibody protein domains, or peptides may be screened. The libraries 
could be generated using phage display, other recombinant DNA methods, or peptide 
synthesis (Vaughan, T. J. et al. Nature Biotechnology 14 : 309-314 (1966)). Such 

10 libraries would commonly be screened using methods which are well known in the art to 
identify sequences which demonstrate specific binding to DPRP. 

It is preferred that the oligopeptides, peptides, or fragments used to induce 
antibodies to DPRP have an amino acid sequence consisting of at least about 5 amino 
acids and, more preferably, of at least about 10 amino acids. It is also preferable that 

15 these oligopeptides, peptides, or fragments are identical to a portion of the amino acid 
sequence of the natural protein. Short stretches of DPRP amino acids may also be fused 
with those of another protein, such as KLH, and antibodies to the chimeric molecule may 
be produced. 

Monoclonal antibodies to DPRP may be prepared using any well known 
20 technique which provides for the production of antibody molecules by continuous cell 
lines in culture. These include, but are not limited to, the hybridoma technique, the 
human B-cell hybridoma technique, and the EBV-hybridoma technique, although 
monoclonal antibodies produced by hybridoma cells may be preferred. 

In addition, techniques developed for the production of "chimeric antibodies", 
25 such as the splicing of mouse antibody genes to human antibody genes to obtain a 
molecule with appropriate antigen specificity and biological activity, can be used, see 
Neuberger, M.S. et al. Nature 3 12 : 604-608 (1984). Alternatively, techniques described 
for the production of single chain antibodies may be adapted, using methods known in 
the art, to produce DPRP-specific single chain antibodies. Antibodies with related 
30 specificity, but of distinct idiotypic composition, may be generated by chain shuffling 
from random combinatorial immunoglobulin libraries. (Burton D. R. Proc. Natl. Acad. 
Sci. 88 : 11120-11123(1991)). 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening immunoglobulin libraries or panels of highly 
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specific binding reagents as disclosed in the literature. (Orlandi, R. et al. Proc. Natl. 
Acad. Sci. 86 : 3833-3837 (1989)). 

Antibody fragments which contain specific binding sites for DPRP may also be 
generated. For example, such fragments include, but are not limited to, F(ab') 2 
5 fragments produced by pepsin digestion of the antibody molecule and Fab fragments 
generated by reducing the disulfide bridges of the F(ab') 2 fragments. Alternatively, Fab 
expression libraries may be constructed to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. (Huse, W. D. et al. Science 254 : 
1275-1281 (1989)). 

10 Various immunoassays may be used to identify antibodies having the desired 

specificity. Numerous protocols for competitive binding or immunoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well 
known in the art. Such immunoassays typically involve the measurement of complex 
formation between DPRP and its specific antibody. A two-site, monoclonal-based 

15 immunoassay utilizing monoclonal antibodies reactive to two non-interfering DPRP 
epitopes is preferred, but a competitive binding assay may also be employed. 

As earlier mentioned, the DPRPs can be used in treatment of the Diseases. 
Pharmaceutical compositions suitable for use in this aspect of the invention include 
compositions wherein the active ingredients are contained in an effective amount to 

20 achieve the intended purpose relating to one of the Diseases. The determination of a 
therapeutically effective dose is well within the capability of those skilled in the art and 
can be estimated initially either in cell culture assays, e.g. of neoplastic cells, or in 
animal models, usually mice, rats, rabbits, dogs, or pigs. An animal model may also be 
used to determine the appropriate concentration range and route of administration, which 

25 information is then commonly used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, e.g. a 
DPRP or fragment thereof, antibodies of DPRP, or an agonist, antagonist or inhibitor of 
DPRP, which ameliorates particular symptoms or conditions of the Disease. For 

30 example, the amount to be administered may be effective to cleave a desired target 
substrate upon contact therewith. Therapeutic efficacy and toxicity may likewise be 
determined by standard pharmaceutical procedures in cell cultures or with experimental 
animals, such as by calculating the ED50 (the dose therapeutically effective in 50% of 
the population) or LD50 (the dose lethal to 50% of the population) statistics. The dose 

35 ratio of toxic to therapeutic effects is the therapeutic index, and it can be expressed as the 
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LD50/ED50 ratio. Pharmaceutical compositions which exhibit large therapeutic indices 
are preferred. The data obtained from cell culture assays and animal studies is used in 
formulating a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that include the 
5 ED50 with little or no toxicity. The dosage varies within this range depending upon the 
dosage form employed, the sensitivity of the patient, and the route of administration. 

An exact dosage will normally be determined by the medical practitioner in light 
of factors related to the subject requiring treatment, with dosage and administration 
being adjusted to provide a sufficient level of the active moiety or to maintain a desired 
10 effect. Factors to be taken into account include the severity of the disease state, the 
general health of the subject, the age, weight, and gender of the subject, diet, time and 
frequency of administration, drug combination(s), reaction sensitivities, and 
tolerance/response to therapy. Long-acting pharmaceutical compositions may be 
administered every 3 to 4 days, every week, or even once every two weeks, depending 
15 on the half-life and clearance rate of the particular formulation. 

Yet another aspect of the invention provides polynucleotide molecules having 
sequences that are antisense to mRNA transcripts of DPRP1, DPRP2 and DPRP-3 
polynucleotides. Administration of an antisense polynucleotide molecule can block the 
production of the protein encoded by DPRP-1, DPRP2 or DPRP-3. The techniques for 
20 preparing antisense polynucleotide molecules and administering such molecules are 
known in the art. For example, antisense polynucleotide molecules can be encapsulated 
into liposomes for fusion with cells. 

In particular, the expression of DPRP-1, DPRP-2 and DPRP-3 in specialized 
epithelial cells, immune cells (lymphocytes and B cells), astrocytic tumors, and in 
25 various hormone sensitive cancers provides evidence of a potential role in the 

pathophysiology of cancer, metaplasia and metastasis. Therefore in a further aspect, the 
invention relates to diagnostic assays for detecting diseases associated with inappropriate 
DPRP activity or expression levels. Antibodies that specifically bind DPRP may be used 
for the diagnosis of disorders characterized by expression of DPRP, or in assays to 
30 monitor patients being treated with DPRP or with agonists or antagonists (inhibitors) of 
DPRP. Antibodies useful for diagnostic purposes may be prepared in the same manner 
as those described above for therapeutics. Diagnostic assays for DPRP include methods 
that utilize the antibody and a label to detect DPRP in human body fluids or in extracts 
of cells or tissues. The antibodies may be used with or without modification, and they 
35 may be labeled by covalent or non-covalent joining with a reporter molecule. A wide 
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variety of reporter molecules are known in the art. Recombinant DPRP proteins that 
have been modified so as to be catalytically inactive can also be used as dominant 
negative inhibitors. Such modifications include, for example, mutation of the active site. 
A variety of protocols for measuring DPRP, including ELISAs, RIAs and FACS, 
5 are known in the art and provide a basis for diagnosing altered or abnormal levels of 
DPRP expression. Normal or standard values for DPRP expression are established by 
combining body fluids or cell extracts taken from normal mammalian subjects, 
preferably human, with antibody to DPRP under conditions suitable for complex 
formation. The method for detecting DPRP in a biological sample would comprise the 

10 steps of: a) providing a biological sample; b) combining the biological sample and an 
anti-DPRP antibody under conditions which are suitable for complex formation to occur 
between DPRP and the antibody; and c) detecting complex formation between DPRP 
and the antibody, thereby establishing the presence of DPRP in the biological sample. 
The amount of complex formation then may be quantified by various methods, 

15 preferably by photometric means. Quantities of DPRP expressed in subject, control, and 
disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding DPRP are 
used for diagnostic purposes, which polynucleotides may include oligonucleotide 

20 sequences, complementary RNA and DNA molecules, and PNAs. These 

polynucleotides may be used to detect and quantitate gene expression in biopsied tissues 
in which expression of DPRP may be correlated with one of the Diseases. The 
diagnostic assay may be used to distinguish between absence, presence, and excess 
expression of DPRP and to monitor regulation of DPRP levels during therapeutic 

25 intervention. Moreover, pharmacogenomic, single nucleotide polymorphisms (SNP) 
analysis of the DPRP genes can be used as a method to screen for mutations that indicate 
predisposition to disease or modified response to drugs. 

DPRP polynucleotide and polypeptide sequences, fragments thereof, antibodies 
of DPRPs, and agonists, antagonists or inhibitors of DPRPs can be used to as discovery 

30 tools to identify molecular recognition events and therefore proteins, polypeptides and 
peptides that interact with DPRP proteins. A specific example is phage display peptide 
libraries where greater than 108 peptide sequences can be screened in a single round of 
panning. Such methods as well as others are known within the art and can be utilized to 
identify compounds that inhibit or enhance DPRP-1, DPRP-2 or DPRP-3 activity. 

35 Coupled links represent functional interactions such as complexes or pathways, and 
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proteins that interact with DPRPs can be identified by a yeast two-hybrid system, 
proteomics (differential 2D gel analysis and mass spectrometry) and genomics 
(differential gene expression by microarray or serial analysis of gene expression SAGE). 
Proteins identified as functionally linked to DPRPs and the process of interaction form 
5 the basis of methods of screening for inhibitors, agonists and antagonists and modulators 
of these DPRP-protein interactions. 

The term "antagonist," as it is used herein, refers to an inhibitor molecule which, 
when bound to DPRP, decreases the amount or the duration of the effect of the 
biological or immunological activity of DPRP, e.g. decreasing the enzymatic activity of 

10 the peptidase to cleave the N-terminal dipeptide. Antagonists may include proteins, 
nucleic acids, carbohydrates, antibodies, or any other molecules which decrease the 
effect of DPRP; for example, they may include small molecules and organic compounds 
that bind to and inactivate DPRPs by a competitive or non-competitive type mechanism. 
Specific examples of DPRP tetrapeptide peptidic enzyme activity inhibitors are 

15 described in Example 6 and 7. Inhibitors can be, for example, inhibitors of the DPRP 
protease activity, or alternatively inhibitors of the binding activity of the DPRP to 
. proteins with which they interact. Specific examples of such inhibitors can include, for 
example, anti-DPRP antibodies, peptides, protein fragments, or small peptidyl protease 
inhibitors, or small non-peptide, organic molecule inhibitors which are formulated in a 

20 medium that allows introduction into the desired cell type. Alternatively, such inhibitors 
can be attached to targeting ligands for introduction by cell-mediated endocytosis and 
other receptor mediated events. Such methods are described further below and can be 
practiced by those skilled in the art given the DPRP nucleotide and amino acid 
sequences described herein. 

25 A further use for DPRPs is for the screening of potential antagonists for use as 

therapeutic agents, for example, for inhibiting binding to DPRP, as well as for screening 
for agonists. DPRP, its immunogenic fragments, or oligopeptides thereof can be used 
for screening libraries of compounds which are prospective agonists or antagonists in 
any of a variety of drug screening techniques. The fragment employed in such screening 

30 may be free in solution, affixed to a solid support, borne on a cell surface, or located 
intracellularly. The formation of binding complexes between DPRP and the agent being 
tested is then measured. Other assays to discover antagonists that will inhibit DPRP are 
apparent from the disclosures of U.S. Patents Nos. 6,011,155, 6,107,317, 6,1 10,949, 
6,124,305 and 6,166,063, which describe inhibitors of DPPIV. Another worthwhile use 
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of these DPRPs is the screening of inhibitors of DPPIV to show that they, will not have 
undesired side effects by also inhibiting one. or more of the DPRPs. 

A method provided for screening a library of small molecules to identify a 
molecule which binds DPRP generally comprises: a) providing a library of small 
5 molecules; b) combining the library of small molecules with the polypeptide of either 
SEQ ID NOS: 1, 3 or 5, or with a fragment thereof, under conditions which are suitable 
for complex formation; and c) detecting complex formation, wherein the presence of 
such a complex identifies a,small molecule which binds DPRP. 

One method for identifying an antagonist comprises delivering a small molecule 

10 which binds DPRP into extracts from cells transformed with a vector expressing DPRP 
along with a chromogenic substrate (e.g. Ala-Pro-AFC or Ala-Pro-AMC) under 
conditions where cleavage would normally occur, and then assaying for inhibition of 
cleavage by the enzyme by monitoring changes in fluorescence, or UV light absorption, 
by spectrophotometry to identify molecules that inhibit cleavage. A reduced rate of 

15 reaction or total amount of fluorescence or UV light absorption, in the presence of the 
molecule, establishes that the small molecule is an antagonist which reduces DPRP 
catalytic/enzymatic activity. Once such molecules are identified, they may be 
administered to reduce or inhibit cleaving by a DPRP. 

The term "agonist," as used herein, refers to a molecule which, when bound to 

20 DPRP, increases or prolongs the duration of the effect of DPRP. Agonists may include 
proteins, nucleic acids, carbohydrates, or any other molecules that bind to and modulate 
the effect of DPRP. Although it is less likely that small molecules will prove to be 
effective DPRP agonists, a method for identifying such a small molecule, which binds 
DPRP as an agonist, comprises delivering a chromogenic form of a small molecule that 

25 binds DPRP into cells transformed with a vector expressing DPRP and assaying for 
fluorescence or UV light absorption changes by spectrophotometry. An increased 
amount of UV absorption or fluorescence would establish that the small molecule is an 
agonist that increases DPRP activity. 

Another technique for drug screening which may be used provides for high 

30 throughput screening of compounds having suitable binding affinity to the protein of 
interest as described in published PCT application WO84/03564. In this method, large 
numbers of different small test compounds are synthesized on a solid substrate, such as 
plastic pins or some other surface. The test compounds are reacted with DPRP, or with 
fragments thereof, and then washed. Bound DPRP is then detected by methods well 

35 known in the art. Purified DPRP can also be coated directly onto plates for use in the . 
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aforementioned drug screening techniques. Alternatively, non-neutralizing antibodies 
can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which 
neutralizing antibodies capable of binding DPRP specifically compete with a test 
5 compound for binding DPRP. In this manner, antibodies can be used to detect the 
presence of any peptide that shares one or more antigenic determinants with DPRP. 

As indicated above, by investigating the binding sites, ligands may be designed 
that, for example, have more interactions with DPRP than do its natural ligands. Such 
antagonist ligands will bind to DPRP with higher affinity and so function as competitive 

10 ligands. Alternatively, synthetic or recombinant proteins homologous or analogous to 
the ligand binding site of native DPRP may be designed, as may other molecules having 
high affinity for DPRP. Such molecules should also be capable of displacing DPRP and 
provide a protective effect. 

As indicated above, the knowledge of the structures of DPRP enables synthetic 

15 binding site homologues and analogues to be designed. Such molecules will facilitate 
greatly the use of the binding properties to target potential therapeutic agents, and they 
may also be used to screen potential therapeutic agents. Furthermore, they may be used 
as immunogens in the production of monoclonal antibodies, which antibodies may 
themselves be used in diagnosis and/or therapy as described hereinbefore. 

20 Given the ubiquitous expression of several members of the prolyl oligopeptidase 

S9B family, cell lines in which targeted gene disruption of DPPIV, DPRP-1, DPRP-2, 
DPRP-3, FAP and DPP VI to establish the null phenotype will be of great value to assist 
screening for selective and potent compounds. Accordingly, the invention provides such 
cell lines engineered with Lox-Neo IRES tk cassette and GFP-IRES-Neo Knock-in/out 

25 cassette DNA element for constructing somatic gene targeting vectors. 

Example 1 

Cloning and Expression of DPRP genes Using the Mammalian Expression System 
DNA fragments encoding the full-length polypeptide DPRP-1 were amplified 
using PCR oligonucleotide primers corresponding to the 5 1 and 3 1 sequences of the gene, 
30 i.e. SEQ ID NO:45 and NO:46. In addition, DNA fragments encoding the full length 
polypeptide DPRP-2 were amplified using PCR oligonucleotide primers corresponding 
to the 5* and 3' sequences of that gene, i.e. SEQ ID NO:50 and NO:51. Furthermore, 
DNA fragments encoding the full length polypeptide DPRP-3 were amplified using PCR 
oligonucleotide primers corresponding to the 5 1 and 3' sequences of that gene, i.e. SEQ 
35 IDNO:55andNO:56. 
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The three amplified sequences were respectively isolated from a 0.7% agarose . 
gel using commercially available kit (GFX PCR DNA and Gel Band Purification Kit, 
Amersham Pharmacia Biotech Inc., Piscataway NJ, USA). The fragments were then 
ligated into cloning vector, pGEM-7Zf(-) (Promega Corporation, Madison WI, USA) 
5 and sequenced. The corresponding cloning constructs were respectively designated 
pGEM7-DPRPl, pGEM7-DPRP2 and pGEM7-DPRP3. The DNA sequences encoding 
the truncated DPRP-1 or DPRP-2 or DPRP-3 were amplified using pGEM7-DPRPl or 
pGEM7-DPRP2 or pGEM7-DPRP3 as a template and BCR oligonucleotide primers. 
SEQ ID NO:45 and NO:47 were used for DPRP-1; SEQ ID NO:50 and NO:52 were used 

10 for DPRP-2; and SEQ ID NO:57 and NO:58 for DPRP-3. The amplified sequences were 
again isolated from a 0.7% agarose gel using the same purification kits and sub-cloned 
into pGEM-7Zf(-). The resulting constructs were designated pGEM7-DPRPlf, pGEM7- 
DPRP2f and pGEM7-DPRP3f. 

To make the DPRP-I mammalian expression construct, pGEM7-DPRPl was 

15 digested with the restriction enzymes Kpnl and NotI to release the full length DPRP-1 
gene. The DNA fragment carrying the DPRP-1 gene was gel band purified using the 
above kit and then inserted into expression vector pcDNA3 (Invitrogen, Carlsbad CA, 
USA) to make the native DPRP-1 expression construct, which was designated pcDNA- 
DPRP1. pGEM7-DPRPlf was digested with the restriction enzymes Xbal and HindlH 

20 to release the truncated DPRP- 1 f gene. The DNA fragment carrying the DPRP- 1 f gene 
was gel band purified using the above kit and then inserted into expression vector 
pcDNA3.1(-)/myc-His A (Invitrogen, Carlsbad CA, USA) to make the tagged DPRP-1 
expression construct pcDNA-MycHis-DPRPl. 

To make the DPRP-2 mammalian expression construct, pGEM7-DPRP2 was 

25 digested with the restriction enzymes Hindlll and BamHI to release the full length 
DPRP-2 gene. The DNA fragment carrying the DPRP-2 gene was gel band purified 
using the above kit and then inserted into expression vector pcDNA3 (Invitrogen, 
Carlsbad CA, USA) to make the native DPRP-2 expression construct, which was 
designated pcDNA-DPRP2. pGEM7-DPRP2f was digested with the restriction enzymes 

30 EcoRI and BamHI to release the truncated DPRP-2f gene. The DNA fragment carrying 
the DPRP-2f gene was gel band purified using the above kit and then inserted into 
expression vector pcDNA3.1(-)/myc-His B (Invitrogen, Carlsbad CA, USA) to make the 
tagged DPRP-2 expression construct designated pcDNA-MycHis-DPRP2. 

To make the DPRP-3 mammalian expression construct, pGEM7-DPRP3 was 

35 digested with the restriction enzymes EcoRI and Xhol to release the full length DPRP-3 
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gene. The DNA fragment carrying the DPRP-3 gene was gel band purified using the 
above kit and then inserted into expression vector pcDNA3 (Invitrogen, Carlsbad CA, 
USA) to make the native DPRP-3 expression construct designated pcDNA-DPRP3. 
pGEM7-DPRP3f was digested with the restriction enzymes Nhel and Apal to release the 
5 truncated DPRP-3f gene. The DNA fragment carrying the DPRP-3f gene was gel band 
purified using the above kit and then inserted into expression vector pcDNA3.1(-)/myc- 
His B (Invitrogen, Carlsbad CA, USA) to make the tagged DPRP-3 expression construct 
pcDNA-MycHis-DPRP3. 

Example 2 

10 Expression Pattern of DPRP genes in human tissues 

Quantitative PCR analysis was carried out to examine the levels of expression of 
the mRNAs for the polypeptides of the present invention in human tissues. RT PCR was 
also carried out on a number of human cell lines including but not limited to prostate 
cancer cells (LNCaP, PC3, DU145), the MLTC-1 line (mouse testis), and MDA-MB23 1 

1 5 cells (breast cancer). Bands of the expected sizes for DPRP- 1 , DPRP-2 and DPPIV were 
all expressed in the various cancer cells lines, with FAP also being expressed at very low 
levels. 

Northern Blot Analysis 

Northern blot analysis was performed with 2jxg poly(A) 4 " RNA isolated from 

20 eight different tissues using DPRP probes. Specifically, a human Multiple Tissue 
Northern (MTN) blot (Clontech, Palo Alto, Calif.) was probed with a 1 kb N-terminal 
fragment that had been radioactively labeled by random priming in the presence of a 
32 PdCTP (A. P. Feinberg et al., Anal. Biochem.. 132, 6 (1983)). Hybridization was 
performed at 68°C overnight in ExpressHyb™ hybridization solution (Clontech, Palo 

25 Alto, Calif.). The blots were first washed at room temperature in 2 times SSC and 
0.05% SDS, and then washed at 60°C (DPRP-1 & DPRP-2) and 50°C (DPRP-3) in 0.1 
times SSC and 0.1% SDS. 

Northern analysis showed expression of DPRP-1 in several tissues with the most 
abundant signal being in testis, prostate, muscle and brain. Testis showed 3 transcripts 

30 approximately 7.5, 4.5 and 2.5 kb in length. The shorter mRNA species was very 
abundant in testis but negligible in the other tissues tested. DPRP-2 was ubiquitously 
expressed in every tissue with highest levels in liver and muscle and a predominant 
transcript at 5kb. DPRP-3 expression was limited to brain and pancreas. Further 
analysis was conducted for the three proteases in specific brain regions (cerebellum, 

35 cortex, medulla, spinal cord, occipital lobe, frontal lobe temporal lobe and putamen). 
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DPRP-1 was expressed in all regions with low levels present in the spinal cord, while 
DPRP-2 was expressed in all brain regions tested. 

Oligonucleotide primers SEQ ID NO:48 and NO:49 were used for DPRP-1 
quantitative PCR, whereas oligonucleotide primers SEQ ID NO:53 and NO:54 were 
5 used for DPRP-2 quantitative PCR. Human Multiple Tissue cDNA (MTC™) Panel I 
and Panel II (Clontech, Palo Alto CA, USA) were used as normalized cDNA templates. 
0.5 ng of each cDNA were used in a-25 fxl PCR reaction, with each primer at a final 
concentration of 300 nM. The PCR reaction was performed using a SYBR Green PCR 
Core Reagents Kit (Applied Biosystems, Foster City CA, USA) and detected with an 

10 Applied Biosystems GeneAmp 5700 sequence detection system. Manufacturer's 

recommended thermal cycling parameter, e.g. 50°C for 2 min, 95°C for 10 min followed 
by 40 cycles of 95°C for 15 sec and 60°C for 1 min was used. Data obtained shows 
relatively high rates of expression for both DPRP-1 and DPRP-2 in the pancreas, ovary 
and testis, and a particularly high rate for DPRP-2 in the liver. 

15 Example 3 - Production of DPRP Polyclonal Antibodies and Western Blotting 
The amino acid sequence deduced from the cDNA encoding DPRP-1 was 
analyzed using DNASTAR software (DNASTAR, Inc.) to determine regions of high 
immunogenicity, and a corresponding oligopeptide was synthesized and used to raise 
anti-DPRP-1 antibodies. The procedure was repeated for DPRP-2 and DPRP-3. The 

20 selection of appropriate peptide sequences and the techniques for antibody production 
are methods well known to those of skill in the art. Selection of appropriate epitopes, 
such as those near the C-terminus or in hydrophilic regions, is well known in this art. 

Typically, oligopeptides that are about 15 to 20 residues in length, e.g. SEQ ID 
NO:59 for DPRP-1, SEQ ID NO:60 for DPRP-2 and SEQ ID NO:61 for DPRP-3, were 

25 synthesized using an Applied Biosystems Peptide Synthesizer Model 43 1 A. 

Fmoc-chemistry was used and the 19- or 15 -residue peptides were respectively coupled 
to keyhole limpet hemocyanin (KLH, Sigma, St Louis, Mo.) by reaction with 
N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS). Rabbits were immunized 
with the oligopeptide-KLH complex in complete Freund ! s adjuvant. The resulting 

30 antisera were tested for antipeptide activity, e.g., by binding the peptide to plastic, 
blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with 
radioiodinated, goat anti-rabbit IgG. 

Western blotting was performed using normal human protein samples (Protein 
Medley) obtained from Clontech (about 36 \ig of total proteins). Proteins were 

35 fractionated through 10% SDS-poiyacrylamide gels, and transferred to 0.45 mm 
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nitrocellulose membranes. Membranes were blocked in Tris-buffered saline (TBS) with 
0.05% Tween 20 and 1% BSA. Anti DPRP-1 or DPRP-2 specific antibodies were used 
as primary antibodies and were diluted 1:5,000 in Tris-buffered saline with 0.05% 
Tween 20 (TBST) and the Alkaline Phosphatase (AP) conjugated goat anti-Rabbit IgG 

5 (Promega) was diluted 1: 5,000 in the same buffer before use. The positive reaction was 
visualized by incubating the membrane in Western Blue Stabilized Substrate (Promega) 
for AP until the bands of interest have reached the desired intensity. DPRP-1 and 
DPRP-2 proteins were detected in brain, muscles, kidney, prostate, testis and ovary 
tissues. DPRP-1 and DPRP-2 were synthesized as approximately lOlkDa and lOOkDa 

10 forms, respectively, which are in good agreement with the molecular masses estimated 
from their primary structure as shown in Table 3. 



Table 3. Predicted Molecular Weight, Number of potential N-linked glycosylation sites 
(Asn residues) and predicted pi values of DPRP-1, DPRP-2 and DPRP-3, based on 
sequence analysis using the method developed by Hopp and Woods, Proc. Nat. Acad. 
15 ScL 78:3824-3828 (1981). 
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Several additional bands of similar molecular weight were observed. These are 
20 thought to be due to the presence of post-translational glycosylation of the proteins. 

Table 3 also shows the number of potential N-glycosylation sites for the DPRP proteins. 
The presence of glycosylated and unglcosylated forms of the proteins was evaluated 
using tunicamycin, an inhibitor of the oligosaccharide synthesis. It is evident that the 
smaller forms were unglycosylated forms. The correlation between mRNA (Northern 
25 analysis) and protein quantity (Western analysis) for DPRP- 1 is shown in Table 4. 



Table 4. Correlation of mRNA and protein expression of DPRP-1 in human tissues 
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Example 4 

Immunohistochemical localization of DPRP proteins in human tissues 
Four-micron sections were prepared from a number of different formalin-fixed, 
paraffin-embedded human tissues. Tissue sections were deparaffined through 4 
5 immersions in xylenes for 5 minutes, followed by a graded alcohol series to distilled 
water. Steam heat induced epitope recovery (SHIER) was used with several different 
SHIER solutions with and without enzyme digestion tissue in two different 
concentrations (Ladner et al, Cancer Res. : 60, p 3493-3503, 2000). The treatments and 
antibody dilutions employed are outlined below. 
10 1 . Blocking Reagent for 1 5 minutes (Normal Goat Serum) 

2. Primary Antibody for 25, 60 min or overnight incubation 

3. Secondary Antibody for 25 minutes (Biotinylated Goat-anti-rabbit IgG) 

4. Endogenous Peroxidase Blocking for 3 x 1.5 minutes 

5. ABC (avidin-biotin complex) / Horse Radish Peroxidase for 25 minutes 
1 5 6. DAB Chromogen for 3 x 5 minutes (Brown reaction product) 

7. Light Hematoxylin Counter Stain 1 minute 
Positive controls were run to assure the detection chemistries and antigen 
pretreatments were working appropriately. Rabbit IgG was run as a negative control. 
An avidin-biotin based tissue staining system was used for the detection of the DPRP-1 

20 antibody. Horseradish peroxide was used as a reporter enzyme with DAB as chromogen. 
After staining, slides were dehydrated through an alcohol series to absolute ethanol 
followed by xylene rinses. Slides were permanently coverslipped with glass coverslips 
and permount. Digital images of representative staining, where positive staining was 
indicated by a dark brown chromogen (DAB-HRP reaction product), were captured 

25 using a video camera from Olympus. Hematoxylin counterstain provides a blue nuclear 
stain to assess cell and tissue morphology. 

DPRP-1 rabbit polyclonal antibody labels formalin-fixed, paraffin-embedded 
human tissues, including normal testis, prostate glands, endometrial glands, tonsils and 
pancreas. It was also present in endothelial cells of normal ovary, bladder and kidney. 

30 Staining was localized in the cytoplasm in epithelial and some stromal cells such as 
fibroblasts, endothelial cells and lymphocytes. Interestingly in normal testis tested with 
DPRP-1 antibodies, there was distinctive expression in Leydig cells and multinucleated 
macrophages found in interstitial tissue, which is the space surrounding the seminiferous 
tubules. Tonsil B cells were stained with DPRP-1 antibody. 
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Example 5 

Mammalian and Insect Cell Expression of DPRP Proteins and Purification 
Plasmid DNA of pcDNA-DPRPl, pcDNA-MycHis-DPRPl, pcDNA-DPRP-2 or 
pcDNA-MycHis-DPRP2 was transfected into PEAK (EdgeBioSystems, Gaithersburg 
5 MD, USA) or COS-1 (ATCC CRL-1650) using LipofectAmine (Life Technologies, 
Gaithersburg MD, USA) method recommended by the manufacturer. Transfected cells 
were maintained in DMEM with 5% FBS at 37°C with 5% C0 2 for 48 hours. Cells were 
then collected and used for recombinant protein extraction. Cells were harvested 48 
hours after transfection, homogenized and then spun at 18,000 x g for 40 min. The 

10 supernata were collected as cytosolic fractions. This fraction was loaded on TALON 
spin column (Clontech), and His-tagged proteins were eluted with 50mM PBS, 150mM 
imidazole, pH 7. Recombinant proteins were then detected by western blotting with 
anti-myc antibody and visualized using a ProtoBlot II AP system (Promega). 
Recombinant affinity purified fusions of the DPRP-1 and DPRP-2 were detected by 

15 western blot, and DPRP-1 and DPRP-2 were synthesized as 1 12kDa and 109kDa forms 
as predicted. 

Naturally occurring or recombinant DPRP proteins were substantially purified by 
immunoaffinity chromatography using antibodies specific for DPRP-1, DPRP-2 or 
DPRP-3. An immunoaffinity column was constructed by covalently coupling DPRP 
20 antibodies to an activated chromatographic resin, such as CNBr-activated Sepharose 
(Pharmacia & Upjohn). After the coupling, the resin was blocked and washed according 
to the manufacturer's instructions. 

Media or cell extracts containing DPRP proteins were passed over the 
immunoaffinity column, and the column was washed under conditions that allow the 
25 preferential absorbance of DPRPs (e.g., high ionic strength buffers in the presence of 
detergent). The column was eluted under conditions that disrupt antibody/DPRP binding 
(e.g., a buffer of pH 2-3 or a high concentration of a chaotrope, such as urea or 
thiocyanate ion), and purified DPRP was collected. 

Example 6 

30 Enzymatic Activity of DPRP proteins an d Methods of Screening for Inhibitors 

The kinetic properties of recombinant DPRP-1 and DPRP-2 were determined in a 
continuous fluorimetric assay. Buffer, pH and temperature dependence optimization led 
to the following assay conditions: Enzyme assays were performed in 50mM PBS, pH7.4 
50 pi (50 |ig/ml) of purified enzymes were mixed with 1 fjtl of different concentration of 
35 Ala-Pro-AMC (Enzyme Systems). Plates were then incubated at 37°C for 30 min, and 
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fluorescence was detected using a Wallac 1420 Fluorimeter with A.ex40355 and Aem535. 
The values of DPRP-1 and DPRP-2 were similar (208 and 161 (iM respectively). 

Further biochemical characterization reveals that DPRP-1 and DPRP-2 have 
similar profiles to DPPIV. The two purified proteases and DPPIV were preincubated 
5 with inhibitors at room temperature for 30 min. Substrate, Ala-Pro- AMC (100 \iM), was 
then added, and the fluorescence intensity was recorded as 60 readings during a 60 min 
period. The irreversible serine protease inhibitor AEBSF was the only inhibitor tested 
that showed strong inhibition of all three enzymes (Table 5). This confirms the structural 
and domain analysis prediction that these proteins belong to the serine protease 
10 superfamily. 

Table 5. Inhibition of DPRP-1 and DPRP-2 by Protease Inhibitors 



Inhibitor 


Inhibitor Property 


Concentration 


Residual activity 
(% of control) 


DPRP-1 


DPRP-2 


DPPIV 


AEBSF 


serine, irreversible 


5mM 


29.6 


23.9 


21.1 


Aprotinin 


serine, reversible 


5ug/ml 


77.5 


63.2 


80.2 


Pepstatin 


aspartic, reversible 


2ug/ml 


97.3 


95.0 


93.5 


DTT 


cysteine 


2mM 


100.1 


94.8 


98.3 


B-Mercaptoethonal 


cysteine 


lOOmM 


93.2 


84.0 


98.0 


EDTA 


metallo, reversible 


2mM 


91.5 


86.0 


93.5 


Leupeptin 


serine, reversible 


50|i.g/ml 


91.1 


90.4 


90.7 



20 In addition to Ala-Pro- AMC, additional substrates tested also confirmed that 

DPRP-1 and DPRP-2 are dipeptidyl peptidases. The data were derived by determining 
the fluorescence change following a 30-minute incubation of the substrates (125 pM) 
with enzymes as a percentage of the fluorescence measured at Ala-Pro-AMC and Gly- 
Pro-AMC were the only good substrates among those tested. 

25 Table 6. DPRP-1 and DPRP-2 are dipeptidyl peptidases. 



Substrate 


% Change in Fluorescence at 30 minutes 


DPRP-1 


DPRP-2 


DPPIV 


Ala-Pro-AMC 


239.0 


127.5 


379.0 


Gly-Pro-AMC 


341.5 


205.0 


444.0 


Ala-Pro-pNA 


45.5 


44.0 


29.5 


Pro-pNA 


-1 


-2.5 


0.0 


Gly-Arg-pNA 


-4.5 


-0.5 


0.0 


Lys-Ala-pNA 


2.5 


0.5 


0.5 


Ala-Phe-Pro-pNA 


-4 


-0.5 


2.0 
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Additional natural and non-natural amino acid di-, tri- and tetra-peptides were tested in 
order to find an optimal substrate for testing each of the DPRP proteins that will also 
show reduced activity when incubated DPPIV. 

The enzyme assay method described here is one of a number of methods that can 
5 be utilized to screen for peptide and non-peptide inhibitors of the DPRP enzymes. 

Libraries of tetrapeptide inhibitors were tested to discover inhibitors of enzyme activity. 
Candidate inhibitors were prepared as 10-20 mM stock solutions in DMSO and stored at 
-20°C. Dilutions were made in assay buffer. Inhibition was determined by comparing 
the changes in fluorescence of the inhibited enzyme to the change in fluorescence of the 

10 control (vehicle) enzyme. 100-(fl units of sample/fl units of control x 100) gives 

percent inhibition value. The percent inhibition and the inhibitor concentration at which 
the enzyme was 50% inhibited (IC 50 ) was ascertained by plotting percent inhibition vs. 
inhibitor concentration on the log scale. As shown in Figure 3, several tetrapeptides 
amides inhibited enzyme activity, wherein data are expressed as the % of activity in the 

15 presence of vehicle (0.02% DMSO) alone. Compounds were added at 1 mM. Most 
interesting was the apparent differential activity of some tetrapeptides for DPRP-1 and 
DPRP-2, compared to DPPIV. While all three enzymes were inhibited by Peptide- 1, 
only DPRP-1 and DPRP-2 were significantly inhibited by Peptide-4 and Peptide-5. This 
demonstrates that selective inhibition of the purified enzymes is achievable. 

20 The assay described in this example can also be used to screen additional 

synthetic or naturally occurring compound libraries, including macromolecules, for 
agents that either inhibit or enhance DPRP activity. The DPRP-1 and DPRP-2 
polypeptides to be used in the assay can be obtained by, for example, in vitro translation, 
recombinant expression (see Example 5) or biochemical procedures. Methods other than 

25 those described here can also be used to screen and identify compounds that inhibit 
DPRP-1, DPRP-2 or DPRP-3, which methods can include, for example, binding assays 
such as ELISAs and RIAs. 

Example 7 

Effect of DPRP Inhibitors on the Proliferation of Human Cancer Cells In Vitro 
30 In an attempt to assess the effect that several inhibitors of DPRP-1 and DPRP-2 

activity may have on the proliferation of human cancer cells, LNCap, PC3 and Dul45, 
mouse testis line MLTC-1 and MDA-MB23 1 breast cancer cells were plated (10 4 per 
well) in 96-well tissue culture plates and allowed to grow and attach for 24 hours at 37°C 
in a C0 2 incubator. Compounds at various dilutions (final dilutions: 0. 1 nM - 10 jiM) 
35 were then added to the wells for various incubation periods from 24 hours to 96 hours, 
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with fresh compound being replaced each day. Addition of the diluent DMSO alone 
served as the control. Following incubation with these compounds in triplicate, 
proliferation of the cells was determined using an XTT cell proliferation assay (Roche 1- 
465-015). The plates were read at 490 and 650nm 5 hours after the XTT mix was added. 
5 An increase in cell proliferation was observed with three of the inhibitors at 

concentrations equal to 0.1, 1, 10 and 100 x IC 50 , and the results are shown in FIGS. 4 A, 
4B and4CforPC3 cells. 

Overall, the DPRPs are expressed in a wide variety of tissues as has been 
demonstrated by mRNA amplification, western blotting and immunohistochemistry. 

10 DPRP-1 was most abundant in the testis by Northern blot and western blot. The large 
number of expressed sequence tags (ESTs) from testis cDNA sources that are 
homologous to DPRP-1 also confirms abundant expression of DPRP-1 in testis. 
Example 4 describes the immunohistochemical localization of DPRP-1 protein in human 
testis using a specific DPRP-1 antibody. DPRP-1 is strongly expressed in epitheloid 

15 Leydig cells, and Leydig cells are the primary source of testicular androgens (male 
steroid hormones) in the mammalian male. In the interstitium of the testis, Leydig cells 
and macrophages are in close association with "digitation" of Leydig cell process 
extending onto macrophage surface. Multinucleated cells in close proximity to the 
Leydig cells were also stained with DPRP-1 antibody suggesting that the protease was 

20 also expressed in macrophages, and macrophages in the testis play an important role in 
the paracrine regulation of Leydig cells. Cytokines secreted by the testicular 
macrophages are mitogenic to Leydig cells and play an important role in the 
differentiation of mesenchymal progenitor cell into mature Leydig cells. A .clearer 
understanding of the proteins and pathways involved in the maturation of the testis is 

25 important for the discovery of new treatments for precocious puberty. In addition, 

Leydig cells cause tumors such as sex cord-stromal tumors via sexual steroid production 
(predominantly testosterone). Testosterone is associated with several neoplasia and 
diseases such as breast carcinoma and uterine cancers, ovarian carcinoma and 
androgenic alopecia (hair loss). Further examination of the localization of DPRP 

30 proteins in other glands in the body (e.g. adrenal glands) that produce testosterone and 
other androgenic hormones are currently under investigation. The possible association 
of DPRP-1 with steroid and polypeptide hormone biosynthetic pathways functions is 
being investigated, and Example 7 is relevant to understanding the role of DPRP proteins 
in prostate, testis and breast in vitro cell models. 
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Imxnunohistochemical analysis also localized DPRP-1 to endometrial glands in 
the uterus (see Example 4), pancreatic acini, glomeruli of the kidney, plasma cells in the 
bladder, a subset of B-cells in the tonsils, columnar epithelial cells of the prostate and 
poorly differentiated prostate squamous metaplasia, Gleason grade 4 prostatic 
5 carcinoma, and hyperplastic glands in benign prostatic hyperplasia. Positive staining in 
breast carcinoma, as well as in seminoma and prostate squamous metaplasia, suggests a 
general association of DPRP-1 with hormone-sensitive tissues, particularly in cells that 
become poorly differentiated. The presence of the DPRP-1 in specialized epithelial cells 
and in inflammatory plasma cells (lymphocytes) is also of interest. Inflammatory breast 

10 carcinoma has an abundance of infiltrating lymphocytes and an overall bad prognosis. 
DPRP-1 and other DPRP proteins appear in medullary carcinomas that typically have a 
constant infiltrating lymphoplasmacytic component at the periphery of the tumor, which 
is thought to represent a reaction of the host tissues to the neoplasm. Most of the 
lymphocytes are T Cells, and most of the plasma cells are of the IgG-producing type. 

15 Several antigens are abundant on B cells, a subgroup of breast-cancer cells, and other 
epithelial cancer cells, and these antigens are targets for a new class of therapeutic 
monoclonal antibodies with some notable success having been achieved with a 
humanized monoclonal antibody against the B-cell-specific antigen CD20. 
Accordingly, monoclonal antibodies to DPRP proteins are felt to be useful to diagnose 

20 and treat diseases in which they are involved, including cancer. 

The expression of DPRP-1 in specialized epithelial cells of a number of tissues 
suggests that DPRP-1 and other DPRP proteins may be involved in growth and 
differentiation thereof. Testing using inhibitors described in Example 6 in in vitro 
models of prostate and testis cancer (Example 7) showed that DPRP-l/DPRP-2 

25 inhibitors caused a 50-60% increase in proliferation of PC3 cells at nM concentrations as 
shown in FIGS. 4A-4C. 

Although the invention has been described in. accordance with its preferred 
embodiments, which constitute the best mode presently known to the inventors, it should 
be understood that changes and modifications as would be obvious to those skilled in 

30 this art may be made without departing from its scope which is set forth in the claims 
appended hereto. For example, although the disclosure focuses on DPRP-1 and DPRP-2 
in certain instances, DPRP-3 and its fragments are considered to be similarly useful, as 
are nucleic acids encoding same. Particular features of the invention are emphasized in 
the claims that follow. 
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CLAIMS: 

1 . Isolated nucleic acid 

which encodes (a) a polypeptide, which includes the amino acid sequence of 
one of SEQ ID NOSrl, 3 and 5, or (b) a polypeptide having an amino acid sequence that is 
at least about 70% similar thereto and exhibits the same biological function; 

or which is an alternative splice variant of one of SEQ ID NOS:2, 4 and 6; or 

which is a probe comprising at least 14 contiguous nucleotides from said 
nucleic acid encoding (a) or (b) ; or 

which is complementary to any one of the foregoing. 

2. The isolated nucleic acid of claim 1 which is DNA or RNA. 

3. The isolated nucleic acid of claim 1 which is a DNA transcript that includes 
the entire length of any one of SEQ ID NOS:2, 4 and 6 or which is complementary to the 
entire coding region of one of SEQ ID NOS:2, 4 and 6. 

4. An antisense oligonucleotide directed against the DNA of claim 3. 

5. The isolated nucleic acid of claim 1 which is an RNA transcript which 
includes the entire length of any one of SEQ ID NOS:2, 4 and 6. 

6. The isolated nucleic acid of claim 1 which is an alternative splice variant of 
one of SEQ ID NOS:2, 4 and 6. 

7. A polypeptide encoded by the nucleic acid of claim 6. 

8. The isolated nucleic acid of claim 1 which encodes a polypeptide having an 
amino acid that is at least about 90% similar to one of SEQ ID NOS: 1 , 3 and 5. 

9. The isolated nucleic acid of claim 1 which encodes a polypeptide having an 
amino acid that is at least about 95% similar to one of SEQ ID NOS: 1, 3 and 5. 

10. The isolated nucleic acid of claim 1 which encodes a polypeptide that has at 
least about 90% identity with one of SEQ ID NOS:l, 3 and 5. 

11. A nucleic acid probe according to claim 1 . comprising at least 14 contiguous 
nucleotides from one of SEQ ID NOS:2, 4 and 6. 

12. An isolated recombinant polynucleotide molecule comprising nucleic acid 
according to claim 1 plus expression-controlling elements linked operably with said nucleic 

. acid to drive expression thereof. 

13. An expression vector comprising the nucleic acid of claim 1 encoding a 
polypeptide having the entire amino acid sequence set forth in any one of SEQ ID NOS:l, 3 
and 5 operably linked to a promoter, said expression vector being present in a 
compatible host cell. 
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14. A mammalian, insect or bacterial host cell that has been genetically 
engineered by the insertion of nucleic acid according to claim 1 which codes for at least the 
mature protein portion of the amino acid sequence of SEQ ID NO: 1, 3 or 5. 

15. A process for producing a polypeptide which includes the mature protein 
portion of one of SEQ ID NOS:l, 3 and 5, which process comprises culturing the host cell 
of claim 1 1 under conditions sufficient for the production of said polypeptide. 

16. The process of claim 15 wherein said polypeptide is expressed at the 
surface of said cell and further includes the step of recovering the polypeptide or a fragment 
thereof from the culture. 

17. A polypeptide 

which may be optionally glycosylated, and 

which (a) has the amino acid sequence of a mature protein set forth in any one 
of SEQ ID NOS: 1 , 3 and 5; (b) has the amino acid sequence of a mature protein having at 
least about 70% similarity to one of the mature proteins of (a) and which exhibits the same 
biological function; (c) has the amino acid sequence of a mature protein having at least 
about 90% identity with a mature protein of any of SEQ ID NOS: 1 , 3 and 5; or (d) is an 
immunologically reactive fragment of (a). 

18. The polypeptide according to claim 14 which is a mature protein having at 
least about 95% similarity to a mature protein of (a). 

19. The polypeptide according to claim 14 which is a mature protein having at 
least about 95% similarity to a mature protein of (a). 

20. The polypeptide according to claim 14 having the amino acid sequence of the 
mature protein of one of SEQ ID NOS: 1 , 3 and 5, or is a fragment thereof which exhibits 
the same biological function as the respective mature protein. 

21. A DPRP antagonist which inhibits the biological function of one of said 
mature proteins of claim 17, 18 and 19. 

22. An antibody that recognizes a polypeptide or a fragment according to 
claim 17. 

23. The antibody of claim 22 which recognizes a polypeptide having an amino 
acid sequence of SEQ ID NO: 1 or 3 or 5. 

24. A method for the screening for a compound capable of inhibiting the 
enzymatic activity of at least one mature protein of claim 17, which method comprises 
incubating said mature protein and a suitable substrate for said mature protein in the 
presence of one or more test compounds or salts thereof, measuring the enzymatic activity 
of said mature protein, comparing said activity with comparable activity determined in the 
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absence of a test compound, and selecting the test compound or compounds that reduce the 
enzymatic activity. 

25. A method for the screening for a compound capable of inhibiting the 
enzymatic activity of DPPIV that does not inhibit the enzymatic activity of at least one of 
the mature proteins of claim 20, which method comprises incubating said mature protein 
and a suitable substrate for said mature protein in the presence of one or more inhibitors of 
DPPIV or salts thereof, measuring the enzymatic activity of said mature protein, comparing 
said activity with comparable activity determined in the absence of the DPPIV inhibitor, 
and selecting a compound that does not reduce the enzymatic activity of said mature 
protein. 
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o 




FIG. 4A 



0.0 0.1 1.0 10.0 100.0 

[inhibitor 88] fold IC 50 



o 




FIG. 4B 



0.0 0.1 1.0 10.0 100.0 

[inhibitor 65] fold IC so 



O 




FIG. 4C 



0.0 0.1 1.0 10.0 100.0 

[Inhibitor 66] fold IC so 
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Sequence Listing Summary 



SEQID. 

1. DPRP1 a.a. sequence 

2. DPRP1 DNA sequence 

5 3. DPRP2 a.a. sequence 

4. DPRP2 DNA sequence 

5. DPRP-3 a.a. sequence 

6. DPRP-3 DNA sequence 

7. DPRP-1 transcript 0 a.a. sequence 
10 8. DPRP-1 transcript 0 DNA sequence 

9. DPRP-1 transcript 1 a.a. sequence 

10. DPRP-1 transcript 1 DNA sequence 

11. DPRP-1 transcript 2 a.a. sequence 
12; DPRP-1 transcript 2 DNA sequence 

15 13. DPRP-1 transcript 3 a.a. sequence 

14. DPRP-1 transcript 3 DNA sequence 

15. DPRP-1 transcript 4 a.a. sequence 

16. DPRP-1 transcript 4 DNA sequence 

17. DPRP-1 transcript 5 a.a. sequence 
20 18. DPRP-1 transcript 5 DNA sequence 

19. DPRP-1 transcript 6 a.a. sequence 

20. DPRP-1 transcript 6 DNA sequence 

21. DPRP-1 transcript 7 a.a. sequence 

22. DPRP-1 transcript 7 DNA sequence 
25 23. DPRP-2 transcript 0 a.a. sequence 

24. DPRP-2 transcript 0 DNA sequence 

25. DPRP-2 transcript 1 a.a. sequence 

26. DPRP-2 transcript 1 DNA sequence 

27. DPRP-2 transcript 2 a.a. sequence 
30 28. DPRP-2 transcript 2 DNA sequence 

29. DPRP-2 transcript 3 a.a. sequence 

30. DPRP-2 transcript 3 DNA sequence 

31. DPRP-2 transcript 4 a.a. sequence 

32. DPRP-2 transcript 4 DNA sequence 
35 33. DPRP-2 transcript 5 a.a. sequence 
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34. DPRP-2 transcript 5 DNA sequence 

35. DPRP-2 transcript 6 a.a. sequence 

36. DPRP-2 transcript 6 DNA sequence 

37. DPRP-2 transcript 7 a.a. sequence 

5 38. DPRP-2 transcript 7 DNA sequence 

39. DPRP-2 transcript 8 a.a. sequence 

40. DPRP-2 transcript 8 DNA sequence 

41. DPRP-3 transcript 0 a.a. sequence 

42. DPRP-3 transcript 0 DNA. Sequence 
10 43. DPRP-3 transcript 1 a.a. sequence 

44. DPRP-3 transcript 1 DNA sequence 

45. DPRP1 forward primer used for cloning 

46. DPRP1 reverse primer used for cloning full length gene 

47. DPRP1 reverse primer used for cloning fusion gene 
15 48. DPRP 1 forward primer used for expression profiling 

49. DPRP1 reverse primer used for expression profiling 

50. DPRP2 forward primer used for cloning 

51. DPRP2 reverse primer used for cloning full length gene 

52. DPRP2 reverse primer used for cloning fusion gene 
20 53 . DPRP2 forward primer used for expression profiling 

54. DPRP2 reverse primer used for expression profiling 

55. DPRP3 forward primer used for cloning 

56. DPRP3 reverse primer used for cloning full length gene 

57. DPRP3 forward primer used for cloning fusion gene 
25 58. DPRP3 reverse primer used for cloning fusion gene 

59. DPRP1 peptide antigen sequences 

60. DPRP2 peptide antigen sequences 

61. DPRP3 peptide antigen sequences 
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SEQUENCE LISTING 



<110> Qi, Steve 

Akinsanya, Karen 
Riviere, Pierre 
Junien, Jean-Louis 

<120> NOVEL SERINE PROTEASE GENES RELATED TO DPPIV 

<130> 70669 

<150> US 60/240,117 

<151> 2000-10-12 

<160> 61 

<170> Patent In version 3.1 



<210> 1 

<211> 882 

<212> PRT 

<213> Homo sapiens 

<400> 1 



Met 


Ala 


Ala 


Ala 


Met 


GlU 


Thr Glu Gin Leu 


Gly Val Glu He Phe 


Glu 


1 








5 




10 


15 




Thr 


Ala 


Asp 


Cys 


Glu 


Glu 


Asn He Glu Ser 


Gin Asp Arg Pro Lys 


Leu 








20 






25 


30- 




Glu 


Pro 


Phe 


Tyr 


Val 


Glu 


Arg Tyr Ser Trp 


Ser Gin Leu Lys Lys 


Leu 






35 








40 


45 ' 




Leu 


Ala 


Asp 


Thr 


Arg 


Lys 


Tyr His Gly Tyr 


Met Met Ala Lys Ala 


Pro 




50 










55 


60 




His 


Asp 


Phe 


Met 


Phe 


Val 


Lys Arg Asn Asp 


Pro Asp Gly Pro His 


Ser 


65 










70 




75 


80 


Asp 


Arg 


He 


Tyr 


Tyr 


Leu 


Ala Met Ser Gly 


Glu Asn Arg Glu Asn 


Thr 










85 




90 


95 




Leu 


Phe 


Tyr 


Ser 


Glu 


He 


Pro Lys Thr He 


Asn Arg Ala Ala Val 


Leu 








100 






105 


110 




Met 


Leu 


Ser 


Trp 


Lys 


Pro 


Leu Leu Asp Leu 


Phe Gin Ala Thr Leu 


Asp 






115 








120 


125 




Tyr 


Gly 


Met 


Tyr 


Ser 


Arg 


Glu Glu Glu Leu 


Leu Arg Glu Arg Lys 


Arg 




130 










135 


140 




lie 


Gly 


Thr 


Val 


Gly 


He 


Ala Ser Tyr Asp 


Tyr His Gin Gly Ser 


Gly 


145 








150 




155 


160 


Thr 


Phe 


Leu 


Phe 


Gin 


Ala 


Gly Ser Gly He 


Tyr His val Lys Asp 


Gly 










165 




170 


175 




Gly 


Pro 


Gin 


Gly 


Phe 


Thr 


Gin Gin Pro Leu 


Arg Pro Asn Leu Val 


Glu 






180 






185 


190 




Thr 


Ser 


Cys 


Pro 


Asn 


He 


Arg Met Asp Pro 


Lys Leu Cys Pro Ala 


Asp 






195 








200 


205 




Pro 


Asp 


Trp 


lie 


Ala 


Phe 


He His Ser Asn 


Asp He Trp He Ser 


Asn 




210 










215 


220 




lie 


Val 


Thr 


Arg 


Glu 


Glu 


Arg Arg Leu Thr 


Tyr Val His Asn Glu 


Leu 


225 










230 




235 


240 


Ala 


Asn 


Met 


Glu 


Glu 


Asp 


Ala Arg Ser Ala 


Gly Val Ala Thr Phe 


Val 










245 




250 


255 
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Leu 


Gin 


GlU 


GlU 








260 


Ala 


Glu 


Thr 


Thr 






275 




Glu 


Asn 


Asp 


Glu 




290 






Leu 


Glu 


Thr 


Arg 


305 








Ala 


Asn 


Pro 


Lys 


Glu 


Gly 


Arg 


He 








340 


Glu 


He 


Leu 


Phe 






355 




Pro 


Glu 


Gly 


Lys 




370 






Arg 


Leu 


Gin 


He 


385 








Asp 


Asp 


Val 


Met 


Val 


Thr 


Pro 


Leu 








420 


lie 


His 


Asp 


He 






435 




Glu 


Phe 


He 


Phe 




450 






Lys 


lie 


Thr 


Ser 


465 








Gly 


Leu 


Pro 


Ala 


Ala 


He 


Thr 


Ser 








500 


lie 


Gin 


Val 


Asp 






515 




Asp 


Ser 


Pro 


Leu 




530 






Gly 


Glu 


Val 


Thr 


545 








lie 


Ser 


Gin 


His 


Asn 


Pro 


His 


Cys 








580 


Pro 


Thr 


Cys 


Lys 






595 




Gly 


Pro 


Leu 


Pro 




610 






Thr 


Thr 


Gly 


Phe 


625 








Gin 


Pro 


Gly 


Lys 




vaj. 




Leu 








660 


Leu 


Asn 


Thr 


Leu 






675 




Arg 


Gly 


Ser 


Cys 




690 






Lys 


Met 


Gly 


Gin 



705 



Phe 


Asp 


Arg 


Tyr 


Pro 


Ser 


Gly 


Gly 








280 


Ser 


Glu 


val 


Glu 






295 




Arg 


Ala 


Asp 


Ser 




310 






Val 


Thr 


Phe 


Lys 


325 








He 


Asp 


Val 


He 


Glu 


Gly 


Val 


Glu 








360 


Tyr 


Ala 


Trp 


Ser 






375 




Val 


Leu 


He 


Ser 




390 






Glu 


Arg 


Gin 


Arg 


405 








He 


He 


Tyr 


Glu 


Phe 


His 


Val 


Phe 








440 


Ala 


Ser 


Glu 


Cys 






455 




He 


Leu 


Lys 


Glu 




470 






Pro 


Ser 


Asp 


Phe 


485 








Gly 


Glu 


Trp 


Glu 


Glu 


Val 


Arg 


Arg 








520 


Glu 


His 


His 


Leu 






535 




Arg 


Leu 


Thr 


Asp 




550 






Cys 


Asp 


Phe 


Phe 


565 








Val 


Ser 


Leu 


Tyr 


Thr 


Lys 


Glu 


Phe 








600 


Asp 


Tyr 


Thr 


Pro 






615 




Thr 


Leu 


Tyr 


Gly 




630 






Lys 


Tyr 


Pro 


Thr 


645 








Val 


Asn 


Asn 


Arg 


Ala 


Ser 


Leu 


Gly 








680 


His 


Arg 


Gly 


Leu 






695 




He 


Glu 


He 


Asp 




710 







Ser 


Gly Tyr 


Trp 


265 






Lys 


He Leu 


Arg 


lie 


xie his 


vai 






300 


Phe 


Arg Tyr 


Pro 




315 




Met 


Ser Glu 


lie 




330 




Asp 


Lys Glu 


Leu 


345 






Tyr 


He Ala 


Arg 


He 


Leu Leu 


Asp 






380 


Pro 


Glu Leu 


Phe 




395 




Leu 


He Glu 


Ser 




410 




Glu 


Thr Thr 


Asp 


425 






Pro 


Gin Ser 


mm A _ 

His 


Lys 


Thr Gly 


Tit. — . 

Phe 






460 


Ser 


Lys Tyr 


Lys 




475 




Lys 


Cys Pro 


He 




490 




Val 


Leu Gly 


Arg 


505 






Leu 


Val Tyr 


Phe 


Tyr 


Val Val 


Ser 






540 


Arg 


Gly Tyr 


Ser 




555 




He 


Ser Lys 


Tyr 




570 




Lys 


Leu Ser 


Ser 


585 






Trp 


Ala Thr 


He 


Pro 


Glu He 


Phe 






620 


Met 


Leu Tyr 


Lys 




635 




Val 


Leu Phe 


He 




650 




Phe 


Lys Gly 


Val 


665 






Tyr 


Val Val 


Val 


Lys 


Phe Glu 


Gly 






700 


Asp 


Gin Val 


Glu 




715 





Trp 


Cys 


Pro 


Lys 




270 






He 


Leu 


Tyr 


Glu 


285 








Thr 


Ser 


Pro 


Met 


Lys 


Thr 


Gly 


Thr 








320 


Met 


He 


Asp 


Ala 






335 




lie 


Gin 


Pro 


Phe 




350 






Ala 


Gly 


Trp 


Thr 


365 








Arg 


Ser 


Gin 


Thr 


He 


Pro 


Val 


Glu 








400 


Val 


Pro 


Asp 


Ser 






415 




He 


Trp 


He 


Asn 




430 






Glu 


Glu 


Glu 


He 


445 








Arg 


His 


Leu 


Tyr 


Arg 


Ser 


Ser 


Gly 








480 


Lys 


Glu 


Glu 


He 






495 




His 


Gly 


Ser 


Asn 




510 






Glu 


Gly 


Thr 


Lys 


525 








Tyr 


Val 


Asn 


Pro 


His 


Ser 


Cys 


Cys 








560 


Ser 


Asn 


Gin 


Lys 






575 




Pro 


Glu 


Asp 


Asp 




590 






Leu 


ASp 


Ser 


Ala 


605 








Ser 


Phe 


Glu 


Ser 


Pro 


HIS 


Asp 


Leu 








640 


Tyr 


Gly 


Gly 


Pro 






655 




Lys 


Tyr 


File 


Arg 




670 






Val 


He 


Asp 


Asn 


685 








Ala 


Phe 


Lys 


Tyr 


Gly 


Leu 


Gin 


Tyr 



720 
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Leu 


Ala 


Ser 


Arg Tyr Asp 


Phe 


He 


Asp 


Leu 


Asp 


Arg 


Val 


Gly He His 








725 








730 








735 


Gly 


Trp 


Ser 


Tyr Gly Gly 


Tyr 


Leu 


Ser 


Leu 


Met 


Ala 


Leu 


Met Gin Arg 








740 






745 










750 


Ser 


Asp 


lie 


Phe Arg Val 


Ala 


He 


Ala 


Gly 


Ala 


Pro 


Val 


Thr Leu Trp 






755 






760 










765 




lie 


Phe 


Tyr 


Asp Thr Gly 


Tyr 


Thr 


Glu 


Arg 


Tyr 


Met 


Gly 


His Pro Asp 




770 






775 










780 






Gin 


Asn 


Glu 


Gin Gly Tyr 


Tyr 


Leu 


Gly 


Ser 


Val 


Ala 


Met 


Gin Ala Glu 


785 






790 










795 






800 


Lys 


Phe 


Pro 


Ser Glu Pro 


Asn 


Arg 


Leu 


Leu 


Leu 


Leu 


HIS 


Gly Phe Leu 








805 








810 








815 


Asp 


Glu 


Asn 


Val His Phe 


Ala 


TT J 

His 


Thr 


Ser 


He 


Leu 


Leu 


Ser Phe Leu 








820 






825 










830 


Val 


Arg 


Ala 


Gly Lys Pro 


Tyr 


Asp 


Leu 


Gin 


He 


Tyr 


Pro 


Gin Glu Arg 






835 






840 










845 




His 


Ser 


He 


Arg Val Pro 


Glu 


Ser 


Gly 


Glu 


His 


Tyr 


Glu 


Leu His Leu 




850 






855 










860 






Leu 


His 


Tyr 


Leu Gin Glu 


Asn 


Leu 


Gly 


Ser 


Arg 


He 


Ala 


Ala Leu Lys 


865 






870 










875 






880 



Val He 

<210> 2 

<211> 2671 

<212> DNA 

<213> Homo sapiens 

<400> 2 

cggtaccatg gcagcagcaa tggaaacaga acagctgggt gttgagatat ttgaaactgc 60 

ggactgtgag gagaatattg aatcacagga tcggcctaaa ttggagcctt tttatgttga 120 

gcggtattcc tggagtcagc ttaaaaagct gcttgccgat accagaaaat atcatggcta 180 

catgatggct aaggcaccac atgatttcat gtttgtgaag aggaatgatc cagatggacc 240 

tcattcagac agaatctatt accttgccat gtctggtgag aacagagaaa atacactgtt 300 

ttattctgaa attcccaaaa ctatcaatag agcagcagtc ttaatgctct cttggaagcc 360 

tcttttggat ctttttcagg caacactgga ctatggaatg tattctcgag aagaagaact 420 

attaagagaa agaaaacgca ttggaacagt cggaattgct tcttacgatt atcaccaagg 48 0 

aagtggaaca tttctgtttc aagccggtag tggaatttat cacgtaaaag atggagggcc 540 

acaaggattt acgcaacaac ctttaaggcc caatctagtg gaaactagtt gtcccaacat 600 

acggatggat ccaaaattat gccctgctga tccagactgg attgctttta tacatagcaa 660 

cgatatttgg atatctaaca tcgtaaccag agaagaaagg agactcactt atgtgcacaa 720 

tgagctagcc aacatggaag aagatgccag atcagctgga gtcgctacct ttgttctcca 780 

agaagaattt gatagatatt ctggctattg gtggtgtcca aaagctgaaa caactcccag 840 

tggtggtaaa attcttagaa ttctatatga agaaaatgat gaatctgagg tggaaattat 900 

tcatgttaca tcccctatgt tggaaacaag gagggcagat tcattccgtt atcctaaaac 960 

aggtacagca aatcctaaag tcacttttaa gatgtcagaa ataatgattg atgctgaagg 1020 

aaggatcata gatgtcatag ataaggaact aattcaacct tttgagattc tatttgaagg 1080 

agttgaatat attgccagag ctggatggac tcctgaggga aaatatgctt ggtccatcct 1140 

actagatcgc tcccagactc gcctgcagat agtgttgatc tcacctgaat tatttatccc 1200 

agtagaagat gatgttatgg aaaggcagag actcattgag tcagtgcctg attctgtgac 1260 

gccactaatt atctatgaag aaacaacaga catctggata aatatccatg acatctttca 1320 

tgtttttccc caaagtcacg aagaggaaat tgagtttatt tttgcctctg aatgcaaaac 1380 

aggtttccgt catttataca aaattacatc tattttaaag gaaagcaaat ataaacgatc 1440 

cagtggtggg ctgcctgctc caagtgattt caagtgtcct atcaaagagg agatagcaat 1500 

taccagtggt gaatgggaag ttcttggccg gcatggatct aatatccaag ttgatgaagt 1560 

cagaaggctg gtatattttg aaggcaccaa agactcccct ttagagcatc acctgtacgt 1620 

agtcagttac gtaaatcctg gagaggtgac aaggctgact gaccgtggct actcacattc 1680 

ttgctgcatc agtcagcact gtgacttctt tataagtaag tatagtaacc agaagaatcc 1740 

acactgtgtg tccctttaca agctatcaag tcctgaagat gacccaactt gcaaaacaaa 1800 

ggaattttgg gccaccattt tggattcagc aggtcctctt cctgactata ctcctccaga 1860 

aattttctct tttgaaagta ctactggatt tacattgtat gggatgctct acaagcctca 1920 
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tgatctacag cctggaaaga aatatcctac tgtgctgttc atatatggtg gtcctcaggt 1980 

gcagttggtg aataatcgat ttaaaggagt caagtatttc cgcttgaata ccctagcctc 2040 

tctaggttat gtggttgtag tgatagacaa caggggatcc tgtcaccgag ggcttaaatt 2100 

tgaaggcgcc tttaaatata aaatgggtca aatagaaatt gacgatcagg tggaaggact 2160 

ccaatatcta gcttctcgat atgatttcat tgacttagat cgtgtgggca tccacggctg 2220 

gtcctatgga ggatacctct ccctgatggc attaatgcag aggtcagata tcttcagggt 2280 

tgctattgct ggggccccag tcactctgtg gatcttctat gatacaggat acacggaacg 2340 

ttatatgggt caccctgacc agaatgaaca gggctattac ttaggatctg tggccatgca 2400 

agcagaaaag ttcccctctg aaccaaatcg tttactgctc ttacatggtt tcctggatga 2460 

gaatgtccat tttgcacata ccagtatatt actgagtttt ttagtgaggg ctggaaagcc 2520 

atatgattta cagatctatc ctcaggagag acacagcata agagttcctg aatcgggaga 2580 

acattatgaa ctgcatcttt tgcactacct tcaagaaaac cttggatcac gtattgctgc 2640 

tctaaaagtg atatgagcgg ccgcgagctc c 2671 

<210> 3 

<211> 863 

<212> PRT 

<213> Homo sapiens 

<400> 3 

Met Ala Thr Thr Gly Thr Pro Thr Ala Asp Arg Gly Asp Ala Ala Ala 

15 10 15 

Thr Asp Asp Pro Ala Ala Arg Phe Gin Val Gin Lys His Ser Trp Asp 

20 25 30 

Gly Leu Arg Ser He He His Gly Ser Arg Lys Tyr Ser Gly Leu He 

35 40 45 

val Asn Lys Ala Pro His Asp Phe Gin Phe Val Gin Lys Thr Asp Glu 

50 55 60 

Ser Gly Pro His Ser His Arg Leu Tyr Tyr Leu Gly Met Pro Tyr Gly 
65 70 75 80 

Ser Arg Glu Asn Ser Leu Leu Tyr Ser Glu He Pro Lys Lys Val Arg 

85 90 95 

Lys Glu Ala Leu Leu Leu Leu Ser Trp Lys Gin Met Leu Asp His Phe 

100 105 110 

Gin Ala Thr Pro His His Gly Val Tyr Ser Arg Glu Glu Glu Leu Leu 

115 120 125 

Arg Glu Arg Lys Arg Leu Gly Val Phe Gly He Thr Ser Tyr Asp Phe 

130 135 140 

His Ser Glu Ser Gly Leu Phe Leu Phe Gin Ala Ser Asn Ser Leu Phe 
145 150 155 160 

His Cys Arg Asp Gly Gly Lys Asn Gly Phe Met Val Ser Pro Met Lys 

165 170 175 

Pro Leu Glu He Lys Thr Gin Cys Ser Gly Pro Arg Met Asp Pro Lys 

180 185 190 

He Cys Pro Ala Asp Pro Ala Phe Phe Ser Phe He Asn Asn Ser Asp 

195 200 205 

Leu Trp Val Ala Asn He Glu Thr Gly Glu Glu Arg Arg Leu Thr Phe 

210 215 220 

Cys His Gin Gly Leu Ser Asn Val Leu Asp Asp Pro Lys Ser Ala Gly 
225 230 235 240 

Val Ala Thr Phe Val He Gin Glu Glu Phe Asp Arg Phe Thr Gly Tyr 

245 250 255 

Trp Trp Cys Pro Thr Ala Ser Trp Glu Gly Ser Glu Gly Leu Lys Thr 

260 265 270 

Leu Arg He Leu Tyr Glu Glu Val Asp Glu Ser Glu Val Glu Val He 

275 280 285 

His Val Pro Ser Pro Ala Leu Glu Glu Arg Lys Thr Asp Ser Tyr Arg 

290 295 300 

Tyr Pro Arg Thr Gly Ser Lys Asn Pro Lys He Ala Leu Lys Leu Ala 
305 310 315 320 
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Glu 


Phe 


Gin 


Thr 


Asp 
325 


Ser 


Gin 


Gly 


Glu 


Leu 


Val 


Gin 
340 


Pro 


Phe 


Ser 


Ser 


Ala 


Arg 


Ala 
355 


Gly 


Trp 


Thr 


Arg 


Asp 
360 


Leu 


Asp 
370 


Arg 


Pro 


Gin 


Gin 


Trp 
375 


Leu 


Leu 


Phe 


He 


Pro 


Ser 


Thr 


GlU 


Asn 


385 










390 






Arg 


Ala 


Val 


Pro 


Arg 
405 


Asn 


Val 


Gin 


Thr 


Asn 


Val 


Trp 
420 


He 


Asn 


Val 


HIS 


Ser 


Glu 


Gly 
435 


Glu 


Asp 


Glu 


Leu 


Cys 
440 


Thr 


Gly 
450 


Phe 


Cys 


His 


Leu 


Tyr 
455 


Lys 


Gly 


Tyr 


Asp 


Trp 


Ser 


Glu 


Pro 


Phe 


465 










470 






Cys 


Pro 


He 


Lys 


Glu 
485 


GlU 


He 


Ala 


Leu 


Ala 


Arg 


His 
500 


Gly 


Ser 


Lys 


He 


Val 


Tyr 


Phe 
515 


Gin 


Gly 


Thr 


Lys 


Asp 
520 


Val 


Val 
530 


Ser 


Tyr 


GlU 


Ala 


Ala 
535 


Gly 


Gly 


Phe 


Ser 


His 


Ser 


Cys 


Ser 


Met 


545 










550 






Ser 


His 


Tyr 


Ser 


Ser 
565 


Val 


Ser 


Thr 


Leu 


Ser 


Gly 


Pro 
580 


Asp 


Asp 


Asp 


Pro 


Ala 


Ser 


Met 
595 


Met 


Glu 


Ala 


Ala 


Ser 
600 


Glu 


He 
610 


Phe 


His 


Phe 


His 


Thr 
615 


Arg 


He 


Tyr 


Lys 


Pro 


His 


Ala 


Leu 


Gin 


625 










630 






Leu 


Phe 


val 


Tyr 


Gly 
645 


Gly 


Pro 


Gin 


Lys 


Gly 


He 


Lys 
660 


Tyr 


Leu 


Arg 


Leu 


Ala 


Val 


Val 
675 


Val 


He 


Asp 


Gly 


Arg 
680 


Phe 


Glu 
690 


Gly 


Ala 


Leu 


Lys 


Asn 
695 


Gin 


Gin 


Val 


Glu 


Gly 


Leu 


Gin 


Phe 


Val 


705 










710 






Leu 


Ser 


Arg 


Val 


Ala 
725 


He 


His 


Gly 


Leu 


Met 


Gly 


Leu 
740 


He 


His 


Lys 


Pro 


Gly 


Ala 


Pro 
755 


val 


Thr 


Val 


Trp 


Met 
760 
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Lys 


lie vai ber inx 


Pin 


Glu 


Lys 




330 




335 




Leu 


Pne Pro Lys val 


Glu 


Tyr 


He 


345 




350 






Gly 


Lys Tyr Ala Trp 


Ala 


Met 


Phe 




365 








Gin 


lieu vai jjeu lieu 


Pro 


Pro 


Ala 




380 








GlU 


Glu Gin Arg Leu 


Aia 


Ser 


Ala 




395 






400 


Pro 


Tyr Val Val Tyr 


GlU 


Glu 


Val 




410 




415 




Asp 


He Phe Tyr Pro 


Pne 


Pro 


Gin 


425 




430 






Pne 


Leu Arg Ala Asn 


GlU 


Cys 


Lys 




445 








val 


Thr Ala Val Leu 


Lys 


Ser 


Gin 




460 








Ser 


Pro Gly Glu Asp 


Glu 


Phe 


Lys 




475 






480 


Leu 


Thr Ser Gly Glu 


Trp 


Glu 


Val 




490 




495 




Trp 


Val Asn Glu Glu 


Thr 


Lys 


Leu 


505 




510 






Thr 


Pro Leu Glu His 


His 


Leu 


Tyr 




525 








Glu 


He Val Arg Leu 


Thr 


Thr 


Pro 




540 








Ser 


Gin Asn Phe Asp 


Met 


Phe 


Val 




555 






560 


Pro 


Pro Cys val His 


vai 


Tyr 


Lys 




570 




575 




Leu 


His Lys Gin Pro 


Arg 


Phe 


Trp 


585 




590 






Cys 


Pro Pro Asp Tyr 


val 


Pro 


Pro 




605 








Ser 


Asp Val Arg Leu 


Tyr 


Gly 


Met 




620 








Pro 


Gly Lys Lys His 


Pro 


Thr 


val 




635 






640 


vai 


Gin jjeu vai Asn 


Asn 


Ser 


Phe 




650 




655 




Asn 


Thr Leu Ala Ser 


Leu 


Gly 


Tyr 


665 




670 






Gly 


Ser Cys Gin Arg 


Giy 


Leu 


Arg 




685 








Met 


Giy Gin vai giu 


lie 


Glu 


Asp 




700 








Ala 


Glu Lys Tyr Gly 


Pne 


He 


Asp 




715 






720 


Trp 


oci lyi Vjiy «rj.y 


it lit; 


Leu 


Ser 




730 




735 




Gin 


Val Phe Lys Val 


Ala 


He 


Ala 


745 




750 






Ala 


Tyr Asp Thr Gly 


Tyr 


Thr 


Glu 



765 
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Arg 


Tyr 


Met 


Asp 


vai 


Pro 


vjiu Asn Asn 


vsrj.n 


tj-5 a 
nlS 


oiy 


XyX VjIU 






770 










775 






7BU 






Ser 


vai 


Ala 


Leu 


fctlS 


IT'S 1 

vai 


oiu jjys lieu 


Pro 


Asn 


ulU 


xriO "•oil 


ZLyvt T.oti 


785 










790 






795 






oOU 


Leu 


lie 


Leu 


His 


Gly 


Pne 


Leu Asp Glu 


Asn 


vai 


HIS 


pne irne 


Tl<i a rpV, -y 
XllS lux 










805 






810 








QIC 

o lb 


Asn 


Phe 


Leu 


Val 


Ser 


Gin 


Leu He Arg 


Ala 


Gly 


Lys 


Pro Tyr 


Gin Leu 








820 






825 








830 




Gin 


He 


Tyr 


Pro 


Asn 


Glu 


Arg His Ser 


He 


Arg 


Cys 


Pro Glu 


Ser Gly 






835 








840 








845 




Glu 


His 


Tyr 


Glu 


Val 


Thr 


Leu Leu His 


Phe 


Leu 


Gin 


Glu Tyr 


Leu 




850 










855 






660 







<210> 4 

<211> 2617 

<212> DNA 

<213> Homo sapiens 

<400> 4 

caagcttacc atggccacca ccgggacccc aacggccgac cgaggcgacg cagccgccac 60 

agatgacccg gccgcccgct tccaggtgca gaagcactcg tgggacgggc tccggagcat 120 

catccacggc agccgcaagt actcgggcct cattgtcaac aaggcgcccc acgacttcca 180 

gtttgtgcag aagacggatg agtctgggcc ccactcccac cgcctctact acctgggaat 240 

gccatatggc agccgagaga actccctcct ctactctgag attcccaaga aggtccggaa 300 

agaggctctg ctgctcctgt cctggaagca gatgctggat catttccagg ccacgcccca 360 

ccatggggtc tactctcggg aggaggagct gctgagggag cggaaacgcc tgggggtctt 420 

cggcatcacc tcctacgact tccacagcga gagtggcctc ttcctcttcc aggccagcaa 480 

cagcctcttc cactgtcgcg acggcggcaa gaacggcttc atggtgtccc ctatgaaacc 540 

gctggaaatc aagacccagt gctcagggcc ccggatggac cccaaaatct gccctgccga 600 

ccctgccttc ttctccttca tcaataacag cgacctgtgg gtggccaaca tcgagacagg 660 

cgaggagcgg cggctgacct tctgccacca aggtttatcc aatgtcctgg atgaccccaa 720 

gtctgcgggt gtggccacct tcgtcataca ggaagagttc gaccgcttca ctgggtactg 780 

gtggtgcccc acagcctcct gggaaggttc agagggcctc aagacgctgc gaatcctgta 840 

tgaggaagtc gatgagtccg aggtggaggt cattcacgtc ccctctcctg cgctagaaga 900 

aaggaagacg gactcgtatc ggtaccccag gacaggcagc aagaatccca agattgcctt 960 

gaaactggct gagttccaga ctgacagcca gggcaagatc gtctcgaccc aggagaagga 1020 

gctggtgcag cccttcagct cgctgttccc gaaggtggag tacatcgcca gggccgggtg 1080 

gacccgggat ggcaaatacg cctgggccat gttcctggac cggccccagc agtggctcca 1140 

gctcgtcctc ctccccccgg ccctgttcat cccgagcaca gagaatgagg agcagcggct 1200 

agcctctgcc agagctgtcc ccaggaatgt ccagccgtat gtggtgtacg aggaggtcac 1260 

caacgtctgg atcaatgttc atgacatctt ctatcccttc ccccaatcag agggagagga 1320 

cgagctctgc tttctccgcg ccaatgaatg caagaccggc ttctgccatt tgtacaaagt 1380 

caccgccgtt ttaaaatccc agggctacga ttggagtgag cccttcagcc ccggggaaga 1440 

tgaatttaag tgccccatta aggaagagat tgctctgacc agcggtgaat gggaggtttt 1500 

ggcgaggcac ggctccaaga tctgggtcaa tgaggagacc aagctggtgt acttccaggg 1560 

caccaaggac acgccgctgg agcaccacct ctacgtggtc agctatgagg cggccggcga 1620 

gatcgtacgc ctcaccacgc ccggcttctc ccatagctgc tccatgagcc agaacttcga 1680 

catgttcgtc agccactaca gcagcgtgag cacgccgccc tgcgtgcacg tctacaagct 1740 

gagcggcccc gacgacgacc ccctgcacaa gcagccccgc ttctgggcta gcatgatgga 1800 

ggcagccagc tgccccccgg attatgttcc tccagagatc ttccatttcc acacgcgctc 1860 

ggatgtgcgg ctctacggca tgatctacaa gccccacgcc ttgcagccag ggaagaagca 1920 

ccccaccgtc ctctttgtat atggaggccc ccaggtgcag ctggtgaata actccttcaa 1980 

aggcatcaag tacttgcggc tcaacacact ggcctccctg ggctacgccg tggttgtgat 2040 

tgacggcagg ggctcctgtc agcgagggct tcggttcgaa ggggccctga aaaaccaaat 2100 

gggccaggtg gagatcgagg accaggtgga gggcctgcag ttcgtggccg agaagtatgg 2160 

cttcatcgac ctgagccgag ttgccatcca tggctggtcc tacgggggct tcctctcgct 2220 

catggggcta atccacaagc cccaggtgtt caaggtggcc atcgcgggtg ccccggtcac 2280 

cgtctggatg gcctacgaca cagggtacac tgagcgctac atggacgtcc ctgagaacaa 2340 

ccagcacggc tatgaggcgg gttccgtggc cctgcacgtg gagaagctgc ccaatgagcc 2400 

caaccgcttg cttatcctcc acggcttcct ggacgaaaac gtgcactttt tccacacaaa 2460 
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cttcctcgtc tcccaactga tccgagcagg gaaaccttac cagctccaga tctaccccaa 2520 
cgagagacac agtattcgct gccccgagtc gggcgagcac tatgaagtca cgttgctgca 2580 
ctttctacag gaatacctct gagcggccgc ggatccg 2617 

<210> 5 

<211> 796 

<212> PRT 

<213> Homo sapiens 

<400> 5 



wee 


Asn 


bJLIl 


inr 


TV T ^ O q 

Ala oci 


17a 1 

vai 


ber 


XT-! „ 

Jala 


iiis ii€ jjyts y~yo vjiu riu oci 


1 








5 








10 lb 


Lys 


inr 


Tl a, 

lie 


Lys 


Glu Leu 


i»iy 


Ser 


Asn 


ber fiO ITiO bill rVLy Asll 1I]J 








20 








25 


.50 


Lys 


Giy 


Tl A 

lie 


Ala 


Tl - TV 1 -a 

lie Ala 


Leu 


T an 

lieu 


vai 


Tl « T t TT^l T Tr^ 1 Pirn Cat* T .m i 

lie Lieu vai vai <~ys ber iieu 






35 








40 






lie 


Thr 


Met 


Ser 


vai lie 


Leu 


Leu 


Ser 


pro Asp ijiu lieu inr ash ber 




50 








55 






60 


Ser 


Glu 


Thr 


Arg 


Leu Ser 


Leu 


Ol mm 

GlU 


ASp 


lieu pne Arg iiys Asp irne vai 


65 








70 








75 80 


Leu 


VTm mm 
HIS 


Asp 


Pro 


Glu Ala 


Arg 


Trp 


Tl 

lie 


Asn Asp Tnr Asp vai vai iyr 










85 








90 95 


Lys 


Ser 


Glu 


Asn 


Gly His 


Val 


Tl 

lie 


Lys 


Leu Asn lie giu inr Asn Aia 








100 








105 


110 


Thr 


Thr 


Leu 


Leu 


Leu Glu 


Asn 


Thr 


Thr 


pne vai inr Fne iiys Aia ber 






115 








120 




125 


Arg 


HIS 


Ser 


val 


Ser Pro 


Asp 


Leu 


Lys 


Tyr vai lieu i»eu Aia ryr asp 




130 








135 






140 


val 


Lys 


Gin 


Tl ex 

lie 


Phe His 


Tyr 


ber 


Tyr 


rpV. -v Til a Gav •"Pi t"v* TTa 1 Tl o fP» ry 

inr Aia ber ryr vai lie xyr 


145 








150 








155 160 


Asn 


lie 


His 


Thr 


Arg Glu 


Val 


Trp 


GlU 


Leu Asn Pro Pro Glu Val Glu 










165 








170 175 


Asp 


Ser 


val 


Leu 


Gin Tyr 


Ala 


7V 1 *-« 

Ala 


Trp 


pi i r— > ~\ /-i 1 pi pi v> Pin T ah 

Giy vai Gin Giy Gin Gin lieu 








180 








185 


190 


Tl «~i 

lie 


Tyr 


ne 


Phe 


Glu Asn 


Asn 


Tl a 

lie 


Tyr 


lyX oin riO ASp lie l*y s oci 






195 








200 




O A C 

205 


Ser 


Ser 


Leu 


Arg 


Leu Thr 


Ser 


Ser 


wiy 


T xra Pi ii Pin Tl a ' Tl A nVi a 7\ en 

juys uiu vjIu ne lie fne ash 




210 








215 








vjiy 


Tl £» 

lie 


Aia 


Asp 


i rp jjeu 


Tyr 


pi ii 

VjlU 


pi ii 


pin T.&n T.oii tr 4 q Cot* Tl a 
oiu jjcll ucu nxo dcx. nio XiC 


225 








230 








O *S C O A A 

235 240 


Ala 


HIS 


Trp 


rp ■ 

Trp 


Ser Pro 


Asp 


pi.. 

Giy 


VjlU 


Arg lieu Aia xrne lieu ciec ne 










245 








o xr a nee 
250 255 


Asn 


Asp 


Ser 


Leu 


vai Fro 


inr 




Veil 


lie rtO Hiy Jrlie xlli Hid 








*5 C A 










A* l\J 


Leu 


Tyr 


Pro 


Lys 


Gly Lys 


Gin 


Tyr 


Pro 


Tyr Pro Lys Ala Gly Gin Val 






275 








280 




285 


Asn 


Pro 


Thr 


He 


Lys Leu 


Tyr 


Val 


Val 


Asn Leu Tyr Gly Pro Thr His 




290 








295 






300 


Thr 


Leu 


Glu 


Leu 


Met Pro 


Pro 


Asp 


Ser 


Phe Lys Ser Arg Glu Tyr Tyr 


305 








310 








315 320 


He 


Thr 


Met 


Val 


Lys Trp 


Val 


Ser 


Asn 


Thr Lys Thr Val Val Arg Trp 










325 








330 335 


Leu 


Ash 


Arg 


Pro 


Gin Asn 


He 


Ser 


He 


Leu Thr Val Cys Glu Thr Thr 








340 








345 


350 


Thr 


Gly 


Ala 


Cys 


Ser Lys 


Lys 


Tyr 


Glu 


Met Thr Ser Asp Thr Trp Leu 






355 








360 




365 


Ser 


Gin 


Gin 


Asn 


Glu Glu 


Pro 


Val 


Phe 


Ser Arg Asp Gly Ser Lys Phe 




370 








375 






380 


Phe 


Met 


Thr 


Val 


Pro Val 


Lys 


Gin 


Gly 


Gly Arg Gly Glu Phe His His 


385 








390 








395 400 
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He Ala Met Phe Leu He Gin Ser Lys Ser Glu Gin He Thr Val Arg 

405 410 415 

His Leu Thr Ser Gly Asn Trp Glu Val He Lys He Leu Ala Tyr Asp 

420 425 430 

Glu Thr Thr Gin Lys He Tyr Phe Leu Ser Thr Glu Ser Ser Pro Arg 

435 440 445 

Gly Arg Gin Leu Tyr Ser Ala Ser Thr Glu Gly Leu Leu Asn Arg Gin 

450 455 460 

Cys He Ser Cys Asn Phe Met Lys Glu Gin Cys Thr Tyr Phe Asp Ala 
465 470 475 480 

Ser Phe Ser Pro Met Asn Gin His Phe Leu Leu Phe Cys Glu Gly Pro 

485 490 495 

Arg Val Pro Val Val Ser Leu His Ser Thr Asp Asn Pro Ala Lys Tyr 

500 505 510 

Phe He Leu Glu Ser Asn Ser Met Leu Lys Glu Ala He Leu Lys Lys 

515 520 525 

Lys He Gly Lys Pro Glu He Lys He Leu His He Asp Asp Tyr Glu 

530 535 540 

Leu Pro Leu Gin Leu Ser Leu Pro Lys Asp Phe Met Asp Arg Asn Gin 
545 550 555 560 

Tyr Ala Leu Leu Leu He Met Asp Glu Glu Pro Gly Gly Gin Leu Val 

565 570 575 

Thr Asp Lys Phe His He Asp Trp Asp Ser Val Leu He Asp Met Asp 

580 585 590 

Asn Val He Val Ala Arg Phe Asp Gly Arg Gly Ser Gly Phe Gin Gly 

595 600 605 

Leu Lys He Leu Gin Glu He His Arg Arg Leu Gly Ser Val Glu val 

610 615 620 

Lys Asp Gin He Thr Ala Val Lys Phe Leu Leu Lys Leu Pro Tyr He 
625 630 635 640 

Asp Ser Lys Arg Leu Ser He Phe Gly Lys Gly Tyr Gly Gly Tyr He 

645 650 655 

Ala Ser Met He Leu Lys Ser Asp Glu Lys Leu Phe Lys Cys Gly Ser 

660 665 670 

Val Val Ala Pro He Thr Asp Leu Lys Leu Tyr Ala Ser Ala Phe Ser 

675 680 685 

Glu Arg Tyr Leu Gly Met Pro Ser Lys Glu Glu Ser Thr Tyr Gin Ala 

690 695 700 

Ala Ser Val Leu His Asn Val His Gly Leu Lys Glu Glu Asn He Leu 
705 710 715 720 

He He His Gly Thr Ala Asp Thr Lys Val His Phe Gin His Ser Ala 

725 730 735 

Glu Leu He Lys His Leu He Lys Ala Gly Val Asn Tyr Thr Met Gin 

740 745 750 

Val Tyr Pro Asp Glu Gly His Asn Val Ser Glu Lys Ser Lys Tyr His 

755 760 765 

Leu Tyr Ser Thr He Leu Lys Phe Phe Ser Asp Cys Leu Lys Glu Glu 

770 775 780 

He Ser Val Leu Pro Gin Glu Pro Glu Glu Asp Glu 
785 790 795 

<210> 6 

<211> 2583 

<212> DNA 

<213> Homo sapiens 

<400> 6 

gcctgggatt gtgcactgtc cagggtcctg aaacatgaac caaactgcca gcgtgtccca 60 
tcacatcaag tgtcaaccct caaaaacaat caaggaactg ggaagtaaca gccctccaca 120 
gagaaactgg aagggaattg ctattgctct gctggtgatt ttagttgtat gctcactcat 180 
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cactatgtca gtcatcctct taagcccaga tgaactcaca aattcgtcag aaaccagatt 240 

gtctttggaa gacctcttta ggaaagactt tgtgcttcac gatccagagg ctcggtggat 300 

caatgataca gatgtggtgt ataaaagcga gaatggacat gtcattaaac tgaatataga 360 

aacaaatgct acdacattat tattggaaaa cacaactttt gtaaccttca aagcatcaag 420 

acattcagtt tcaccagatt taaaatatgt ccttctggca tatgatgtca aacagatttt 480 

tcattattcg tatactgctt catatgtgat ttacaacata cacactaggg aagtttggga 540 

gttaaatcct ccagaagtag aggactccgt cttgcagtac gcggcctggg gtgtccaagg 600 

gcagcagctg atttatattt ttgaaaataa tatctactat caacctgata taaagagcag 660 

ttcattgcga ctgacatctt ctggaaaaga agaaataatt tttaatggga ttgctgactg 720 

gttatatgaa gaggaactcc tgcattctca catcgcccac tggtggtcac cagatggaga 780 

aagacttgcc ttcctgatga taaatgactc tttggtaccc accatggtta tccctcggtt 840 

tactggagcg ttgtatccca aaggaaagca gtatccgtat cctaaggcag gtcaagtgaa 900 

cccaacaata aaattatatg ttgtaaacct gtatggacca actcacactt tggagctcat 960 

gccacctgac agctttaaat caagagaata ctatatcact atggttaaat gggtaagcaa 1020 

taccaagact gtggtaagat ggttaaaccg acctcagaac atctccatcc tcacagtctg 1080 

tgagaccact acaggtgctt gtagtaaaaa atatgagatg acatcagata cgtggctctc 1140 

tcagcagaat gaggagcccg tgttttctag agacggcagc aaattcttta tgacagtgcc 1200 

tgttaagcaa gggggacgtg gagaatttca ccacatagct atgttcctca tccagagtaa 1260 

aagtgagcaa attaccgtgc ggcatctgac atcaggaaac tgggaagtga taaagatctt 1320 

ggcatacgat gaaactactc aaaaaattta ctttctgagc actgaatctt ctcccagagg 1380 

aaggcagctg tacagtgctt ctactgaagg attattgaat cgccaatgca tttcatgtaa 1440 

tttcatgaaa gaacaatgta catattttga tgccagtttt agtcccatga atcaacattt 1500 

cttattattc tgtgaaggtc caagggtccc agtggtcagc ctacatagta cggacaaccc 1560 

agcaaaatat tttatattgg aaagcaattc tatgctgaag gaagctatcc tgaagaagaa 1620 

gataggaaag ccagaaatta aaatccttca tattgacgac tatgaacttc ctttacagtt 1680 

gtcccttccc aaagatttta tggaccgaaa ccagtatgct cttctgttaa taatggatga 1740 

agaaccagga ggccagctgg ttacagataa gttccatatt gactgggatt ccgtactcat 1800 

tgacatggat aatgtcattg tagcaagatt tgatggcaga ggaagtggat tccagggtct 1860 

gaaaattttg caggagattc atcgaagatt aggttcagta gaagtaaagg accaaataac 1920 

agctgtgaaa tttttgctga aactgcctta cattgactcc aaaagattaa gcatttttgg 1980 

aaagggttat ggtggctata ttgcatcaat gatcttaaaa tcagatgaaa agctttttaa 2040 

atgtggatcc gtggttgcac ctatcacaga cttgaaattg tatgcctcag ctttctctga 2100 

aagatacctt gggatgccat ctaaggaaga aagcacttac caggcagcca gtgtgctaca 2160 

taatgttcat ggcttgaaag aagaaaatat attaataatt catggaactg ctgacacaaa 2220 

agttcatttc caacactcag cagaattaat caagcaccta ataaaagctg gagtgaatta 2280 

tactatgcag gtctacccag atgaaggtca taacgtatct gagaagagca agtatcatct 2340 

ctacagcaca atcctcaaat tcttcagtga ttgtttgaag gaagaaatat ctgtgctacc 2400 

acaggaacca gaagaagatg aataatggac cgtatttata cagaactgaa gggaatattg 2460 

aggctcaatg aaacctgaca aagagactgt aatattgtag ttgctccaga atgtcaaggg 2520 

cagcttacgg agatgtcact ggagcagcac gctcagagac agtgaactag catttgaata 2580 

cac 2583 

<210> 7 

<211> 690 

<212> PRT 

<213> Homo sapiens 

<400> 7 



Met 


Ala Ala 


Ala 


Met 


Glu Thr Glu Gin 


Leu Gly 


Val Glu lie Phe 


Glu 


1 






5 




10 


15 




Thr 


Ala Asp 


Cys 


Glu 


Glu Asn lie Glu 


Ser Gin 


Asp Arg Pro Lys. 


Leu 






20 




25 




30 




Glu 


Pro Phe 


Tyr 


Val 


Glu Arg Tyr Ser 


Trp Ser 


Gin Leu Lys Lys 


Leu 




35 






40 




45 




Leu 


Ala Asp 


Thr 


Arg 


Lys Tyr His Gly 


Tyr Met 


Met Ala Lys Ala 


Pro 




50 






55 




60 




His 


Asp Phe 


Met 


Phe 


val Lys Arg Asn 


Asp Pro 


Asp Gly Pro His 


Ser 


65 








70 


75 




80 


Asp 


Arg lie 


Tyr 


Tyr 


Leu Ala Met Ser 


Gly Glu 


Asn Arg Glu Asn 


Thr 








85 




90 


95 
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Leu 


Phe 


Tyr 


Ser 


GlU 


He 


Pro 


Lys Thr 


He Asn Arg Ala 


TV 1 -i TT-a T T ah 

Aia vai iieu 






100 








105 




110 


Met 


Leu 


Ser 


Trp 


Lys 


Pro 


Leu 


Leu Asp 


Leu Pne Gin Aia 


Thr Leu Asp 






115 










120 


125 




Tyr 


Gly 


Met 


Tyr 


Ser 


Arg 


Glu 


Glu Glu 


Leu Leu Arg Glu 


Arg Lys Arg 




130 










135 




140 




He 


Gly 


Thr 


val 


Gly 


He 


Ala 


Ser Tyr 


Asp Tyr His Gin 


Gly Ser Gly 


145 










150 






155 


160 


Thr 


Phe 


Leu 


Phe 


Gin 


Ala 


Gly 


Ser Gly 


He Tyr His Val 


Lys Asp Gly 










165 








170 


175 


Gly 


Pro 


Gin 


Gly 


Phe 


Thr 


Gin 


Gin Pro 


Leu Arg Pro Asn 


Leu val Glu 








180 








185 




190 


Thr 


Ser 


Cys 


Pro 


Asn 


He 


Arg 


Met Asp 


Pro Lys Leu Cys 


Pro Ala Asp 






195 










200 


205 




Pro 


Asp 


Trp 


He 


Ala 


Phe 


He 


His Ser 


Asn Asp He Trp 


He Ser Asn 




210 










215 




220 




He 


Val 


Thr 


Arg 


Glu 


Glu 


Arg 


Arg Leu 


Thr Tyr Val His 


Asn Glu Leu 


225 










230 






235 


240 


Ala 


Asn 


Met 


Glu 


Glu 


Asp 


Ala 


Arg Ser 


Ala Gly Val Ala 


Thr Phe Val 










245 






250 


255 


Leu 


Gin 


Glu 


Glu 


Phe 


Asp 


Arg 


Tyr Ser 


Gly Tyr Trp Trp 


Cys Pro Lys 








260 








265 




270 


Ala 


Glu 


Thr 


Thr 


Pro 


Ser 


Gly 


Gly Lys 


He Leu Arg He 


Leu Tyr Glu 






275 










280 


285 




Glu 


Asn 


Asp 


Glu 


Ser 


Glu 


Val 


Glu He 


He His Val Thr 


Ser Pro Met 




290 








295 




300 




Leu 


Glu 


Thr 


Arg 


Arg 


Ala 


Asp 


Ser Phe 


Arg Tyr Pro Lys 


Thr Gly Thr 


305 










310 






315 


320 


Ala 


Asn 


Pro 


Lys 


Val 


Thr 


Phe 


Lys Met 


Ser Glu He Met 


He Asp Ala 










325 








330 


335 


Glu 


Gly 


Arg 


He 


He 


Asp 


Val 


He Asp 


Lys Glu Leu He 


Gin Pro Phe 








340 








345 




350 


Glu 


He 


Leu 


Phe 


Glu 


Gly 


Val 


Glu Tyr 


He Ala Arg Ala 


Gly Trp Thr 






355 










360 


365 




Pro 


Glu 


Gly 


Lys 


Tyr 


Ala 


Trp 


Ser He 


Leu Leu Asp Arg 


Ser Gin Thr 




370 










375 




380 




Arg 


Leu 


Gin 


He 


Val 


Leu 


He 


Ser Pro 


Glu Leu Phe He 


Pro Val Glu 


385 










390 






395 


400 


Asp 


Asp 


Val 


Met 


Glu 


Arg 


Gin 


Arg Leu 


lie Glu Ser Val 


Pro Asp Ser 










405 








410 


415 


Val 


Thr 


Pro 


Leu 


He 


He 


Tyr 


Glu Glu 


Thr Thr Asp He 


Trp He Asn 








420 








425 




430 


He 


His 


Asp 


He 


Phe 


His 


Val 


Phe Pro 


Gin Ser His Glu 


Glu Glu He 






435 










440 


445 




Glu 


Phe 


He 


Phe 


Ala 


Ser 


Glu 


Cys Lys 


Thr Gly Phe Arg 


His Leu Tyr 




450 










455 




460 




Lys 


He 


Thr 


Ser 


He 


Leu 


Lys 


Glu Ser 


Lys Tyr Lys Arg 


Ser Ser Gly 


465 










470 






475 


480 


Gly 


Leu 


Pro 


Ala 


Pro 


Ser 


Asp 


Phe Lys 


Cys Pro He Lys 


GlU Glu lie 








485 








490 


495 


Ala 


He 


Thr 


Ser 


Gly 


Glu 


Trp 


Glu Val 


Leu Gly Arg His 


Gly Ser Asn 








500 








505 




510 


He 


Gin 


Val 


Asp 


Glu 


Val 


Arg 


Arg Leu 


Val Tyr Pne Glu 


Gly Thr Lys 






515 










520 


525 




Asp 


Ser 


Pro 


Leu 


GlU 


His 


His 


Leu Tyr 


Val val Ser Tyr 


Val Asn Pro 




530 










535 




540 




Gly 


Glu 


Val 


Thr 


Arg 


Leu 


Thr 


Asp Arg 


Gly Tyr Ser His 


Ser Cys Cys 


545 










550 






555 


560 


He 


Ser 


Gin 


His 


Cys 


Asp 


Phe 


Phe He 


Ser Lys Tyr Ser 


Asn Gin Lys 










565 








570 


575 
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Asn 


Pro 


His 


Cys 
580 


Val 


Ser 


Leu 


Tyr 


Lys 
585 


Leu 


Ser 


Ser 


Pro 


Glu 
590 


Asp 


Asp 


Pro 


Thr 


Cys 
595 


Lys 


Thr 


Lys 


Glu 


Phe 
600 


Trp 


Ala 


Thr 


He 


Leu 
605 


Asp 


Ser 


Ala 


Gly 


Pro 
610 


Leu 


Pro 


Asp 


Tyr 


Thr 
615 


Pro 


Pro 


Glu 


He 


Phe 
620 


Ser 


Phe 


Glu 


Ser 


Thr 


Thr 


Gly 


Phe 


Thr 


Leu 


Tyr 


Gly 


Met 


Leu 


Tyr 


Lys 


Pro 


His 


Asp 


Leu 


625 










630 










635 










640 


Gin 


Pro 


Gly 


Lys 


Lys 
645 


Tyr 


Pro 


Thr 


Val 


Leu 
650 


Phe 


He 


Tyr 


Gly 


Gly 
655 


Arg 


Leu 


Leu 


Leu 


Leu 
660 


Gly 


Pro 


Gin 


Ser 


Leu 
665 


Cys 


Gly 


Ser 


Ser 


Met 
670 


He 


Gin 


Asp 


Thr 


Arg 
675 


Asn 


Val 


He 


Trp 


Val 
680 


Thr 


Leu 


Thr 


Arg 


Met 
685 


Asn 


Arg 


Ala 



He Thr 



690 



<210> 8 

<211> 4523 

<212> DNA 

<213> Homo sapiens 

<400> 8 

aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat; caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 960 

gctggagtcg ctacctttgt tctccaagaa gaatttgata gatattctgg ctattggtgg 1020 

tgtccaaaag ctgaaacaac tcccagtggt ggtaaaattc ttagaattct atatgaagaa 1080 

aatgatgaat ctgaggtgga aattattcat gttacatccc ctatgttgga aacaaggagg 1140 

gcagattcat tccgttatcc taaaacaggt acagcaaatc ctaaagtcac ttttaagatg 1200 

tcagaaataa tgattgatgc tgaaggaagg atcatagatg tcatagataa ggaactaatt 1260 

caaccttttg agattctatt tgaaggagtt gaatatattg ccagagctgg atggactcct 1320 

gagggaaaat atgcttggtc cat cct acta gatcgctccc agactcgcct acagatagtg 1380 

ttgatctcac ctgaattatt tatcccagta gaagatgatg ttatggaaag gcagagactc 1440 

attgagtcag tgcctgattc tgtgacgcca ctaattatct atgaagaaac aacagacatc 1500 

tggataaata tccatgacat ctttcatgtt tttccccaaa gtcacgaaga ggaaattgag 1560 

tttatttttg cctctgaatg caaaacaggt ttccgtcatt tatacaaaat tacatctatt 1620 

ttaaaggaaa gcaaatataa acgatccagt ggtgggctgc ctgctccaag tgatttcaag 1680 

tgtcctatca aagaggagat agcaattacc agtggtgaat gggaagttct tggccggcat 1740 

ggatctaata tccaagttga tgaagtcaga aggctggtat attttgaagg caccaaagac 1800 

tcccctttag agcatcacct gtacgtagtc agttacgtaa atcctggaga ggtgacaagg 1860 

ctgactgacc gtggctactc acattcttgc tgcatcagtc agcactgtga cttctttata 1920 

agtaagtata gtaaccagaa gaatccacac tgtgtgtccc tttacaagct atcaagtcct 1980 

gaagatgacc caacttgcaa aacaaaggaa ttttgggcca ccattttgga ttcagcaggt 2040 

cctcttcctg actatactcc tccagaaatt ttctcttttg aaagtactac tggatttaca 2100 

ttgtatggga tgctctacaa gcctcatgat ctacagcctg gaaagaaata tcctactgtg 2160 
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ctgttcatat atggtggtcg gttgctattg ctggggcccc agtcactctg tggatcttct 2220 

atgatacagg atacacggaa cgttatatgg gtcaccctga ccagaatgaa cagggctatt 2280 

acttaggatc tgtggccatg caagcagaaa agttcccctc tgaaccaaat cgtttactgc 2340 

tcttacatgg tttcctggat gagaatgtcc attttgcaca taccagtata ttactgagtt 2400 

ttttagtgag ggctggaaag ccatatgatt tacagatcta tcctcaggag agacacagca 2460 

taagagttcc tgaatcggga gaacattatg aactgcatct tttgcactac cttcaagaaa 2520 

accttggatc acgtattgct gctctaaaag tgatataatt ttgacctgtg tagaactctc 2580 

tggtatacac tggctattta accaaatgag gaggtttaat caacagaaaa cacagaattg 2640 

atcatcacat tttgatacct gccatgtaac atctactcct gaaaataaat gtggtgccat 2700 

gcaggggtct acggtttgtg gtagtaatct aataccttaa ccccacatgc tcaaaatcaa 2760 

atgatacata ttcctgagag acccagcaat accataagaa ttactaaaaa aaaaaaaaaa 2820 

aaaaagacat tagcaccatg tattcatact accctatttt cacttttaat agtattataa 2880 

acttcatgaa cttaattagt gtatttttac agtatacttt tgagtttgtt aaaatatgat 2940 

gatattagtg attggtttgg ttcagttcca gaatctttga ctagttacag atttgatagc 3000 

acttaaatgt aattgaatag cttatgcttc attgcttggg catatccagc atgttatgaa 3060 

ctaataacta ttaaacttga cttaaccagt cattcattaa taatttttca aggataactt 3120 

agtggcctcc taaagacact tgttttggca ctgaccagtt tttagccaat ttaatctgta 3180 

tctagtataa ataattctca tttttctttg atgatattaa cagagtgggc ttttcctttt 3240 

gcataaaggc tagtaactgt atatgtagca tggatttaat tagtcatgat attgataatt 3300 

acaggcagaa aatttttaat caaatgatta gagcttaaat atttgcaggc aagttttttt 3360 

ttttccttta agaaaaggaa aaagtacaca ttcactagaa ttcttcagaa aatttagtgg 3420 

tgccagtttc catttggtat ttccttatta aaatattcta gaattttaag gagattgaag 3480 

ggaatcacag tggggtgggg agacctgggt ttggggaatg acagagagaa gaggtggtga 3540 

gggcctgatt aaaaactaag cagaagtagt tttaacaaaa atactcatga aaatgtttgg 3600 

aaactgaaat ttaaacaact gtaatattaa ggaaaccaga atcaataaat cactgtcttg 3660 

ccagcacagc tacagagtaa catgattcag gggaggaaaa gttccttaga gttactttta 3720 

taattctttt tttttttcct cttaggttta gaaatcttac aaatttaaac tttatccttt 3780 

taaaattatt tgaacataat ttagatattg taagcttaaa atacaaatgt ttatagataa 3840 

cctctttacc ataaactaat ccctggcaag ccatggctct cttttttttt ttggtgttta 3900 

aagcctgtaa acagtttttc tgaatgatca tgaacttttc ttggtttagc actaggattt 3960 

agctatgaag agagctcata ggctttcagg tgctaattga gatctgccct gttagagtct 4020 

tggggtgcta gattggtcac attgacacca gtggcaggga aggcatctat gagtttgatg 4080 

ctttttatca cacacttcag tgtttagaaa gttattacca atacttttaa acaacactcc 4140 

aagaaaattt gctatatttc tttctcatca ctacagagag agtagatttc cccatagaga 4200 

gcacagcctc cattagtaag gttggtgact attggtaaga ggtggacttc attgacacca 4260 

agtgggaggt agggaaagcc cagaaatggc aggatgatat ggtggttctg tcgttgggaa 4320 

aggtattggg ttttgctgtt tgtatttata ctgtataata gataccacgc tttttcttat 4380 

tatctgtata tgtattgctt ttcatgtttg atattttccc atgccaagat ttgtttatat 4440 

atattttcaa tgttaaatta aattgatttg ggtaactttc ttccccaaga aagtattttc 4500 

ccccttaagt ataaatctga ctg 4523 

<210> 9 

<211> 241 

<212> PRT 

<213> Homo sapiens 

<400> 9 



Met 


Ala 


Ala 


Ala 


Met 


Glu 


Thr Glu 


Gin 


Leu 


Gly 


Val 


Glu 


He 


Phe 


Glu 


1 








5 








10 










15 




Thr 


Ala 


Asp 


Cys 


Glu 


Glu 


Asn He 


Glu 


Ser 


Gin 


Asp 


Arg 


Pro 


Lys 


Leu 








20 








25 










30 






Glu 


Pro 


Phe 


Tyr 


Val 


Glu 


Arg Tyr 


Ser 


Trp 


Ser 


Gin 


Leu 


Lys 


Lys 


Leu 






35 








40 










45 








Leu 


Ala 


Asp 


Thr 


Arg 


Lys 


Tyr His 


Gly 


Tyr 


Met 


Met 


Ala 


Lys 


Ala 


Pro 




50 










55 








60 










His 


Asp 


Phe 


Met 


Phe 


Val 


Lys Arg 


Asn 


Asp 


Pro 


Asp 


Gly 


Pro 


His 


Ser 


65 










70 








75 










80 


Asp 


Arg 


He 


Tyr 


Tyr 


Leu 


Ala Met 


Ser 


Gly 


Glu 


Asn 


Arg 


Glu 


Asn 


Thr 










85 








90 










95 




Leu 


Phe 


Tyr 


Ser 


Glu 


He 


Pro Lys 


Thr 


He 


Asn 


Arg 


Ala 


Ala 


Val 


Leu 



14 



WO 02/31134 



PCT/US01/31874 









100 




Met 


Leu 


Ser 


Trp Lys Pro 


Leu Leu 






115 




120 


Tyr 


Gly 


Met 


Tyr Ser Arg 


GlU Glu 




130 






135 


He 


Gly 


Thr 


Val Gly He 


Ala Ser 


145 






150 


Gly Ser 


Thr 


Phe 


Leu 


Phe Gin Ala 








165 




Gly 


Pro 


Gin 


Gly Phe Thr 


Gin Gin 








180 




Thr 


Ser 


Cys 


Pro Asn He 


Arg Met 






195 




200 


Pro 


Asp 


Trp 


He Ala Phe 


He His 




210 






215 


He 


val 


Thr 


Arg Glu Glu 


Arg Arg 


225 






230 




Ala 











105 










110 






Asp 


Leu 


Phe 


Gin 


Ala 
125 


Thr 


Leu 


Asp 


Glu 


Leu 


Leu 


Arg 

140 


Glu 


Arg 


Lys 


Arg 


Tyr 


Asp 


Tyr 
155 


His 


Gin 


Gly 


Ser 


Gly 
160 


Gly 


He 
170 


Tyr 


His 


Val 


Lys 


Asp 
175 


Gly 


Pro 


Leu 


Arg 


Pro 


Asn 


Leu 


Val 


Glu 


185 










190 






Asp 


Pro 


Lys 


Leu 


Cys 
205 


Pro 


Ala 


Asp 


Ser 


Asn 


Asp 


He 
220 


Trp 


He 


Ser 


Asn 


Leu 


Thr 


Tyr 
235 


Val 


His 


Asn 


Gly 


Lys 
240 



<210> 10 

<211> 1356 

<212> DNA 

<213> Homo sapiens 

<400> 10 



aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct . 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatggt aaggcgtagt tcttcagatt tacttttctg 960 

aacagtattt tttgaagtat aatttgctgc ttgcattttg aaattagatt accacgttgg 1020 

gtgatcttta tatttgaaat tcaagtcttt aaaattttta aaaaatggag aaaagtacag 1080 

aggataactt gtatgtacca catgtataat attcatttta atgttttaat gttcattttc 1140 

aaacagtgaa acaaaagaac ctctgacatg attgttcttt tagcttgcta agactgccag 1200 

aattttccca aaactgttct tattaaaata aaattttagg ctaggcatgg tggctcatgc 1260 

ctgtaatcct agcactctgg gaggctgagg caggcagatt gtttgagccc agaagttcaa 1320 

gatcaggatg ggcaacatgg tgacacctcg tttgac 1356 

<210> 11 

<211> 661 

<212> PRT 

<213> Homo sapiens 

<400> 11 

Met Ala Ala Ala Met Glu Thr Glu Gin Leu Gly Val Glu He Phe Glu 

15 10 15 

Thr Ala Asp Cys Glu Glu Asn lie Glu Ser Gin Asp Arg Pro Lys Leu 

20 25 30 

Glu Pro Phe Tyr Val Glu Arg Tyr Ser Trp Ser Gin Leu Lys Lys Leu 
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35 






40 


Leu 


Ala 


Asp 


Thr 


Arg 


Lys Tyr His 




50 








55 


His 


Asp 


Phe 


Met 


Phe 


Val Lys Arg 


65 










70 


Asp 


Arg 


He 


Tyr 


Tyr 


Leu Ala Met 










85 




Leu 


Phe 


Tyr 


Ser 


Glu 


He Pro Lys 








100 






Met 


Leu 


Ser 


Trp 


Lys 


Pro Leu Leu 






115 






120 


Tyr 


Gly 


Met 


Tyr 


Ser 


Arg Glu Glu 




130 








135 


He 


Gly 


Thr 


Val 


Gly 


He Ala Ser 


145 










150 


Thr 


Phe 


Leu 


Phe 


Gin 


Ala Gly Ser 










165 




Gly 


Pro 


Gin 


Gly 


Phe 


Thr Gin Gin 








180 






Thr 


Ser 


Cys 


Pro 


Asn 


He Arg Met 






195 






200 


Pro 


Asp 


Trp 


He 


Ala 


Phe He His 




210 








215 


He 


Val 


Thr 


Arg 


Glu 


Glu Arg Arg 


225 










230 


Ala 


Asn 


Met 


Glu 


Glu 


Asp Ala Arg 










245 




Leu 


Gin 


Glu 


Glu 


Phe 


Asp Arg Tyr 








260 






Ala 


Glu 


Thr 


Thr 


Pro 


Ser Gly Gly 






275 






280 


Glu 


Asn 


Asp 


Glu 


Ser 


Glu Val Glu 




290 








295 


Leu 


Glu 


Thr 


Arg 


Arg 


Ala Asp Ser 


305 










310 


Ala 


Asn 


Pro 


Lys 


Val 


Thr Phe Lys 










325 




Glu 


Gly 


Arg 


He 


He 


Asp Val He 








340 






Glu 


He 


Leu 


Phe 


Glu 


Gly Val Glu 






355 






360 


Pro 


Glu 


Gly 


Lys 


Tyr 


Ala Trp Ser 




370 








375 


Arg 


Leu 


Gin 


He 


Val 


Leu He Ser 


385 










390 


Asp 


Asp 


Val 


Met 


Glu 


Arg Gin Arg 










405 




Val 


Thr 


Pro 


Leu 


He 


He Tyr Glu 








420 






He 


His 


Asp 


He 


Phe 


His Val Phe 






435 






440 


Glu 


Phe 


He 


Phe 


Ala 


Ser Glu Cys 




450 








455 


Lys 


He 


Thr 


Ser 


He 


Leu Lys Glu 


465 










470 


Gly 


Leu 


Pro 


Ala 


Pro 


Ser Asp Phe 










485 




Ala 


He 


Thr 


Ser 


Gly 


Glu Trp Glu 








500 






He 


Gin 


Val 


Asp 


GlU 


Val Arg Arg 



45 



Gly Tyr Met Met Ala 


Lys 


Ala 


Pro 


60 








Asn Asp Pro Asp Gly 


Pro 


His 


Ser 


75 






80 


Ser Gly Glu Asn Arg 


Glu 


Asn 


Thr 


90 




95 




Thr He Asn Arg Ala 


Ala 


Val 


Leu 


105 


110 






Asp Leu Phe Gin Ala 


Thr 


Leu 


Asp 


125 








Glu Leu Leu Arg Glu 


Arg 


Lys 


Arg 


140 








Tyr Asp Tyr His Gin 


Gly 


Ser 


Gly 


155 






160 


Gly He Tyr His Val 


Lys 


Asp 


Gly 


170 




175 




Pro Leu Arg Pro Asn 


Leu 


Val 


Glu 


185 


190 






Asp Pro Lys Leu Cys 


Pro 


Ala 


Asp 


205 








Ser Asn Asp He Trp 


He 


Ser 


Asn 


220 








Leu Thr Tyr Val His 


Asn 


Glu 


Leu 


235 






240 


Ser Ala Gly Val Ala 


Thr 


Phe 


Val 


250 




255 




Ser Gly Tyr Trp Trp 


Cys 


Pro 


Lys 


265 


270 






Lys He Leu Arg He 


Leu 


Tyr 


Glu 


285 








lie He His val Thr 


Ser 


Pro 


Met 


300 








Phe Arg Tyr Pro Lys 


Thr 


Gly 


Thr 


315 






320 


Met Ser Glu He Met 


He 


Asp 


Ala 


330 




335 




Asp Lys Glu Leu He 


Gin 


Pro 


Phe 


345 


350 






Tyr He Ala Arg Ala 


Gly 


Trp 


Thr 


365 








He Leu Leu Asp Arg 


Ser 


Gin 


Thr 


380 








Pro Glu Leu Phe He 


Pro 


Val 


Glu 


395 






400 


Leu He Glu Ser Val 


Pro 


Asp 


Ser 


410 




415 




Glu Thr Thr Asp He 


Trp 


He 


Asn 


425 


430 






Pro Gin Ser His Glu 


GlU 


Glu 


He 


445 








Lys Thr Gly Phe Arg 


His 


Leu 


Tyr 


460 








Ser Lys Tyr Lys Arg 


Ser 


Ser 


Gly 


475 






480 


Lys Cys Pro He Lys 


Glu 


Glu 


He 


490 




495 




Val Leu Gly Arg His 


Gly 


Ser 


Asn 


505 


510 






Leu Val Tyr Phe Glu 


Gly 


Thr 


Lys 
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515 






520 


Asp 


Ser 


Pro 


Leu 


Glu 


His His Leu 




530 








535 


Gly 


Glu 


Val 


Thr 


Arg 


Leu Thr Asp 


545 










550 


He 


Ser 


Gin 


His 


Cys 


Asp Phe Phe 










565 




Asn 


Pro 


His 


Cys 


Val 


Ser Leu Tyr 








580 






Pro 


Thr 


Cys 


Lys 


Thr 


Lys Glu Phe 






595 






600 


Gly 


Pro 


Leu 


Pro 


Asp 


Tyr Thr Pro 




610 








615 


Thr 


Thr 


Gly 


Phe 


Thr 


Leu Tyr Gly 


625 










630 


Gin 


Pro 


Gly 


Lys 


Lys 


Tyr Pro Thr 










645 




Leu 


Arg 


Cys 


Ser 


Trp 










660 







525 

Tyr Val Val Ser Tyr Val Asn Pro 
540 

Arg Gly Tyr Ser His Ser Cys Cys 
555 560 
He Ser Lys Tyr Ser Asn Gin Lys 

570 575 
Lys Leu Ser Ser Pro Glu Asp Asp 
585 590 
Trp Ala Thr He Leu Asp Ser Ala 
605 

Pro Glu He Phe Ser Phe Glu Ser 
620 

Met Leu Tyr Lys Pro His Asp Leu 
635 640 
Val Leu Phe He Tyr Gly Gly Leu 
650 655 



<210> 12 

<211> 4829 

<212> DNA 

<213> Homo sapiens 

<400> 12 



aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 960 

gctggagtcg ctacctttgt tctccaagaa gaatttgata gatattctgg ctattggtgg 1020 

tgtccaaaag ctgaaacaac tcccagtggt ggtaaaattc ttagaattct atatgaagaa 1080 

aatgatgaat ctgaggtgga aattattcat gttacatccc ctatgttgga aacaaggagg 1140 

gcagattcat tccgttatcc taaaacaggt acagcaaatc ctaaagtcac ttttaagatg 1200 

tcagaaataa tgattgatgc tgaaggaagg atcatagatg tcatagataa ggaactaatt 1260 

caaccttttg agattctatt tgaaggagtt gaatatattg ccagagctgg atggactcct 1320 

gagggaaaat atgcttggtc catcctacta gatcgctccc agactcgcct acagatagtg 1380 

ttgatctcac ctgaattatt tatcccagta gaagatgatg ttatggaaag gcagagactc 1440 

attgagtcag tgcctgattc tgtgacgcca ctaattatct atgaagaaac aacagacatc 1500 

tggataaata tccatgacat ctttcatgtt tttccccaaa gtcacgaaga ggaaattgag 1560 

tttatttttg cctctgaatg caaaacaggt ttccgtcatt tatacaaaat tacatctatt 1620 

ttaaaggaaa gcaaatataa acgatccagt ggtgggctgc ctgctccaag tgatttcaag 1680 

tgtcctatca aagaggagat agcaattacc agtggtgaat gggaagttct tggccggcat 1740 

ggatctaata tccaagttga tgaagtcaga aggctggtat attttgaagg caccaaagac 1800 

tcccctttag agcatcacct gtacgtagtc agttacgtaa atcctggaga ggtgacaagg 1860 

ctgactgacc gtggctactc acattcttgc tgcatcagtc agcactgtga cttctttata 1920 

agtaagtata gtaaccagaa gaatccacac tgtgtgtccc tttacaagct atcaagtcct 1980 

gaagatgacc caacttgcaa aacaaaggaa ttttgggcca ccattttgga ttcagcaggt 2040 
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cctcttcctg actatactcc tccagaaatt ttctcttttg aaagtactac tggatttaca 2100 

ttgtatggga tgctctacaa gcctcatgat ctacagcctg gaaagaaata tcctactgtg 2160 

ctgttcatat atggtggtct cctcaggtgc agttggtgaa taatcggttt aaaggagtca 2220 

agtatttccg cttgaatacc ctagcctctc taggttatgt ggttgtagtg atagacaaca 2280 

ggggatcctg tcaccgaggg cttaaatttg aaggcgcctt taaatataaa atgggtcaaa 2340 

tagaaattga cgatcaggtg gaaggactcc aatatctagc ttctcgatat gatttcattg 2400 

acttagatcg tgtgggcatc cacggctggt cctatggagg atacctctcc ctgatggcat 2460 

taatgcagag gtcagatatc ttcagggttg ctattgctgg ggccccagtc actctgtgga 2520 

tcttctatga tacaggatac acggaacgtt atatgggtca ccctgaccag aatgaacagg 2580 

gctattactt aggatctgtg gccatgcaag cagaaaagtt cccctctgaa ccaaatcgtt 2640 

tactgctctt acatggtttc ctggatgaga atgtccattt tgcacatacc agtatattac 2700 

tgagtttttt agtgagggct ggaaagccat atgatttaca gatctatcct caggagagac 2760 

acagcataag agttcctgaa tcgggagaac attatgaact gcatcttttg cactaccttc 2820 

aagaaaacct tggatcacgt attgctgctc taaaagtgat ataattttga cctgtgtaga 2880 

actctctggt atacactggc tatttaacca aatgaggagg tttaatcaac agaaaacaca 2940 

gaattgatca tcacattttg atacctgcca tgtaacatct actcctgaaa ataaatgtgg 3000 

tgccatgcag gggtctacgg tttgtggtag taatctaata ccttaacccc acatgctcaa 3060 

aatcaaatga tacatattcc tgagagaccc agcaatacca taagaattac taaaaaaaaa 3120 

aaaaaaaaaa agacattagc accatgtatt catactaccc tattttcact tttaatagta 3180 

ttataaactt catgaactta attagtgtat ttttacagta tacttttgag tttgttaaaa 3240 

tatgatgata ttagtgattg gtttggttca gttccagaat ctttgactag ttacagattt 3300 

gatagcactt aaatgtaatt gaatagctta tgcttcattg cttgggcata tccagcatgt 3360 

tatgaactaa taactattaa acttgactta accagtcatt cattaataat ttttcaagga 3420 

taacttagtg gcctcctaaa gacacttgtt ttggcactga ccagttttta gccaatttaa 3480 

tctgtatcta gtataaataa ttctcatttt tctttgatga tattaacaga gtgggctttt 3540 

ccttttgcat aaaggctagt aactgtatat gtagcatgga tttaattagt catgatattg 3600 

ataattacag gcagaaaatt tttaatcaaa tgattagagc ttaaatattt gcaggcaagt 3660 

tttttttttt cctttaagaa aaggaaaaag tacacattca ctagaattct tcagaaaatt 3720 

tagtggtgcc agtttccatt tggtatttcc ttattaaaat attctagaat tttaaggaga 3780 

ttgaagggaa tcacagtggg gtggggagac ctgggtttgg ggaatgacag agagaagagg 3840 

tggtgagggc ctgattaaaa actaagcaga agtagtttta acaaaaatac tcatgaaaat 3900 

gtttggaaac tgaaatttaa acaactgtaa tattaaggaa accagaatca. ataaatcact 3960 

gtcttgccag cacagctaca gagtaacatg attcagggga ggaaaagttc cttagagtta 4020 

cttttataat tctttttttt tttcctctta ggtttagaaa tcttacaaat ttaaacttta 4080 

tccttttaaa attatttgaa cataatttag atattgtaag cttaaaatac aaatgtttat 4140 

agataacctc tttaccataa actaatccct ggcaagccat ggctctcttt ttttttttgg 4200 

tgtttaaagc ctgtaaacag tttttctgaa tgatcatgaa cttttcttgg tttagcacta 4260 

ggatttagct atgaagagag ctcataggct ttcaggtgct aattgagatc tgccctgtta 4320 

gagtcttggg gtgctagatt ggtcacattg acaccagtgg cagggaaggc atctatgagt 4380 

ttgatgcttt ttatcacaca cttcagtgtt tagaaagtta ttaccaatac ttttaaacaa 4440 

cactccaaga aaatttgcta tatttctttc tcatcactac agagagagta gatttcccca 4500 

tagagagcac agcctccatt agtaaggttg gtgactattg gtaagaggtg gacttcattg 4560 

acaccaagtg ggaggtaggg aaagcccaga aatggcagga tgatatggtg gttctgtcgt 4620 

tgggaaaggt attgggtttt gctgtttgta tttatactgt ataatagata ccacgctttt 4680 

tcttattatc tgtatatgta ttgcttttca tgtttgatat tttcccatgc caagatttgt 4740 

ttatatatat tttcaatgtt aaattaaatt gatttgggta actttcttcc ccaagaaagt 4800 

attttccccc ttaagtataa atctgactg 4829 

<210> 13 

<211> 358 

<212> PRT 

<213> Homo sapiens 

<400> 13 

Met Ala Ala Ala Met Glu Thr Glu Gin Leu Gly Val Glu lie Phe Glu 

15 10 15 

Thr Ala Asp Cys Glu Glu Asn He Glu Ser Gin Asp Arg Pro Lys Leu 

20 25 30 

Glu Pro Phe Tyr Val Glu Arg Tyr Ser Trp Ser Gin Leu Lys Lys Leu 
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Leu 


Ala Asp 


Thr 


Arg 


Lys 


Tyr 


His 




50 








55 




His 


Asp Phe 


Met 


Phe 


Val 


Lys 


Arg 


65 








70 






Asp 


Arg He 


Tyr 


Tyr 


Leu 


Ala 


Met 








85 








Leu 


Phe Tyr 


Ser 


Glu 


He 


Pro 


Lys 






100 










Met 


Leu Ser 


Trp 


Lys 


Pro 


Leu 


Leu 




115 










120 


Tyr 


Gly Met 


Tyr 


Ser 


Arg 


Glu 


Glu 




130 








135 




He 


Gly Thr 


val 


Gly 


He 


Ala 


Ser 


145 








150 






Thr 


Phe Leu 


Phe 


Gin 


Ala 


Gly 


Ser 








165 








Gly 


Pro Gin 


Gly 


Phe 


Thr 


Gin 


Gin 






180 










Thr 


Ser Cys 


Pro 


Asn 


He 


Arg 


Met 




195 










200 


Pro 


Asp Trp 


He 


Ala 


Phe 


lie 


His 




210 








215 




He 


Val Thr 


Arg 


Glu 


Glu 


Arg 


Arg 


225 








230 






Ala 


Asn Met 


Glu 


Glu 


Asp 


Ala 


Arg 








245 








Leu 


Gin Glu 


Glu 


Phe 


Asp 


Arg 


Tyr 






260 










Ala 


Glu Thr 


Thr 


Pro 


Ser 


Gly 


Gly 




275 










280 


Glu 


Asn Asp 


Glu 


Ser 


Glu 


Val 


Glu 




290 








295 




Leu 


Glu Thr 


Arg 


Arg 


Ala 


Asp 


Ser 


305 








310 






Ala 


Asn Pro 


Lys 


Val 


Thr 


Phe 


Lys 








325 








Glu 


Gly Arg 


Ser 


Lys 


Leu 


Met 


Lys 






340 










Ala 


Pro Lys 


Thr 


Pro 


Leu 







355 




PCT/US01/31874 



Gly 


Tyr 


Met 


Met 


Ala 


Lys Ala 


Pro 








60 








Asn 


Asp 


Pro 


Asp 


Gly 


Pro His 


Ser 






75 








80 


Ser 


Gly 


Glu 


Asn 


Arg 


Glu Asn 


Thr 




90 








95 




Thr 


He 


Asn 


Arg 


Ala 


Ala Val 


Leu 


105 










110 




Asp 


Leu 


Phe 


Gin 


Ala 


Thr Leu 


Asp 










125 






Glu 


Leu 


Leu 


Arg 


Glu 


Arg Lys 


Arg 








140 








Tyr 


Asp 


Tyr 


His 


Gin 


Gly Ser 


Gly 






155 








160 


Gly 


He 


Tyr 


His 


Val 


Lys Asp 


Gly 




170 








175 




Pro 


Leu 


Arg 


Pro 


Asn 


Leu Val 


Glu 


185 










190 




Asp 


Pro 


Lys 


Leu 


Cys 


Pro Ala 


Asp 










205 






Ser 


Asn 


Asp 


lie 


Trp 


lie Ser 


Asn 








220 








Leu 


Thr 


Tyr 


Val 


His 


Asn Glu 


Leu 






235 








240 


Ser 


Ala 


Gly 


Val 


Ala 


Thr Phe 


val 




250 








255 




Ser 


Gly 


Tyr 


Trp 


Trp 


Cys Pro 


Lys 


265 










270 




Lys 


He 


Leu 


Arg 


He 


Leu Tyr 


Glu 










285 






He 


He 


His 


Val 


Thr 


Ser Pro 


Met 








300 








Phe 


Arg 


Tyr 


Pro 


Lys 


Thr Gly 


Thr 






315 








320 


Met 


Ser 


Glu 


He 


Met 


He Asp 


Ala 




330 








335 




Ser 


Glu 


Gly 


Trp 


Tyr 


He Leu 


Lys 


345 










350 





<210> 14 

<211> 4309 

<212> DNA 

<213> Homo sapiens 

<400> 14 



aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 
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ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 960 

gctggagtcg ctacctttgt tctccaagaa gaatttgata gatattctgg ctattggtgg 1020 

tgtccaaaag ctgaaacaac tcccagtggt ggtaaaattc ttagaattct atatgaagaa 1080 

aatgatgaat ctgaggtgga aattattcat gttacatccc ctatgttgga aacaaggagg 1140 

gcagattcat tccgttatcc taaaacaggt acagcaaatc ctaaagtcac ttttaagatg 1200 

tcagaaataa tgattgatgc tgaaggaaga tccaagttga tgaagtcaga aggctggtat 1260 

attttgaagg caccaaagac tcccctttag agcatcacct gtacgtagtc agttacgtaa 1320 

atcctggaga ggtgacaagg ctgactgacc gtggctactc acattcttgc tgcatcagtc 1380 

agcactgtga cttctttata agtaagtata gtaaccagaa gaatccacac tgtgtgtccc 144 0 

tttacaagct atcaagtcct gaagatgacc caacttgcaa aacaaaggaa ttttgggcca 1500 

ccattttgga ttcagcaggt cctcttcctg actatactcc tccagaaatt ttctcttttg 1560 

aaagtactac tggatttaca ttgtatggga tgctctacaa gcctcatgat ctacagcctg 1620 

gaaagaaata tcctactgtg ctgttcatat atggtggtct cctcaggtgc agttggtgaa 1680 

taatcggttt aaaggagtca agtatttccg cttgaatacc ctagcctctc taggttatgt 1740 

ggttgtagtg atagacaaca ggggatcctg tcaccgaggg cttaaatttg aaggcgcctt 1800 

taaatataaa atgggtcaaa tagaaattga cgatcaggtg gaaggactcc aatatctagc 1860 

ttctcgatat gatttcattg acttagatcg tgtgggcatc cacggctggt cctatggagg 192 0 

atacctctcc ctgatggcat taatgcagag gtcagatatc ttcagggttg ctattgctgg 1980 

ggccccagtc actctgtgga tcttctatga tacaggatac acggaacgtt atatgggtca 2040 

ccctgaccag aatgaacagg gctattactt aggatctgtg gccatgcaag cagaaaagtt 2100 

cccctctgaa ccaaatcgtt tactgctctt acatggtttc ctggatgaga atgtccattt 2160 

tgcacatacc agtatattac tgagtttttt agtgagggct ggaaagccat atgatttaca 2220 

gatctatcct caggagagac acagcataag agttcctgaa tcgggagaac attatgaact 2280 

gcatcttttg cactaccttc aagaaaacct tggatcacgt attgctgctc taaaagtgat 2340 

ataattttga cctgtgtaga actctctggt atacactggc tatttaacca aatgaggagg 2400 

tttaatcaac agaaaacaca gaattgatca tcacattttg atacctgcca tgtaacatct 2460 

actcctgaaa ataaatgtgg tgccatgcag gggtctacgg tttgtggtag taatctaata 2520 

ccttaacccc acatgctcaa aatcaaatga tacatattcc tgagagaccc agcaatacca 2580 

taagaattac taaaaaaaaa aaaaaaaaaa agacattagc accatgtatt catactaccc 2640 

tattttcact tttaatagta ttataaactt catgaactta attagtgtat ttttacagta 2700 

tacttttgag tttgttaaaa tatgatgata ttagtgattg gtttggttca gttccagaat 2760 

ctttgactag ttacagattt gatagcactt aaatgtaatt gaatagctta tgcttcattg 2820 

cttgggcata tccagcatgt tatgaactaa taactattaa acttgactta accagtcatt 2880 

cattaataat ttttcaagga taacttagtg gcctcctaaa gacacttgtt ttggcactga 2940 

ccagttttta gccaatttaa tctgtatcta gtataaataa ttctcatttt tctttgatga 3000 

tattaacaga gtgggctttt ccttttgcat aaaggctagt aactgtatat gtagcatgga 3060 

tttaattagt catgatattg ataattacag gcagaaaatt tttaatcaaa tgattagagc 3120 

ttaaatattt gcaggcaagt tttttttttt cctttaagaa aaggaaaaag tacacattca 3180 

ctagaattct tcagaaaatt tagtggtgcc agtttccatt tggtatttcc ttattaaaat 3240 

attctagaat tttaaggaga ttgaagggaa tcacagtggg gtggggagac ctgggtttgg 3300 

ggaatgacag agagaagagg tggtgagggc ctgattaaaa actaagcaga agtagtttta 3360 

acaaaaatac tcatgaaaat gtttggaaac tgaaatttaa acaactgtaa tattaaggaa 342 0 

accagaatca ataaatcact gtcttgccag cacagctaca gagtaacatg attcagggga 3480 

ggaaaagttc cttagagtta cttttataat tctttttttt tttcctctta ggtttagaaa 3540 

tcttacaaat ttaaacttta tccttttaaa attatttgaa cataatttag atattgtaag 3600 

cttaaaatac aaatgtttat agataacctc tttaccataa actaatccct ggcaagccat 3660 

ggctctcttt ttttttttgg tgtttaaagc ctgtaaacag tttttctgaa tgatcatgaa 3720 

cttttcttgg tttagcacta ggatttagct atgaagagag ctcataggct ttcaggtgct 3780 

aattgagatc tgccctgtta gagtcttggg gtgctagatt ggtcacattg acaccagtgg 3840 

cagggaaggc atctatgagt ttgatgcttt ttatcacaca cttcagtgtt tagaaagtta 3900 

ttaccaatac ttttaaacaa cactccaaga aaatttgcta tatttctttc tcatcactac 3960 

agagagagta gatttcccca tagagagcac agcctccatt agtaaggttg gtgactattg 4020 

gtaagaggtg gacttcattg acaccaagtg ggaggtaggg aaagcccaga aatggcagga 4080 

tgatatggtg gttctgtcgt tgggaaaggt attgggtttt gctgtttgta tttatactgt 4140 

ataatagata ccacgctttt tcttattatc tgtatatgta ttgcttttca tgtttgatat 4200 

tttcccatgc caagatttgt ttatatatat tttcaatgtt aaattaaatt gatttgggta 4260 

actttcttcc ccaagaaagt attttccccc ttaagtataa atctgactg 4309 
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<210> 15 

<211> 108 

<212> PRT 

<213> Homo sapiens 

<400> 15 



Met Ala Ala Ala Met Glu Thr Glu Gin Leu Gly Val Glu He Phe Glu 

15 10 15 

Thr Ala Asp Cys Glu Glu Asn lie Glu Ser Gin Asp Arg Pro Lys Leu 

20 25 30 

Glu Pro Phe Tyr Val Glu Arg Tyr Ser Trp Ser Gin Leu Lys Lys Leu 

35 40 45 

Leu Ala Asp Thr Arg Lys Tyr His Gly Tyr Met Met Ala Lys Ala Pro 

50 55 60 

His Asp Phe Met Phe Val Lys Arg Asn Asp Pro Asp Gly Pro His Ser 
65 70 75 80 

Asp Arg He Tyr Tyr Leu Gly Asn Lys Ser Leu He Asp His Asp Arg 

85 90 95 

Phe Ser Lys Ser Lys Met Pro Glu He Ala Ser Ser 
100 105 

<210> 16 

<211> 620 

<212> DMA 

<213> Homo sapiens 

<400> 16 



aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tggtaacaag 480 

tcattaattg atcatgatcg tttttcaaaa tcgaagatgc cagaaattgc ttcttcctaa 540 

agctagcttg aaatgccttt ctttagatgg tctgattagg aaaacaaaca ataaaaccat 600 

tagtttgttc ccactcaaca 620 

<210> 17 

<211> 194 

<212> PRT 

<213> Homo sapiens 

<400> 17 
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Met 


Ala 


Ala 


Ala 


Met 


Glu 


Thr 


Glu 


1 








5 








Thr 


Ala 


Asp 


Cys 


Glu 


Glu 


Asn 


He 








20 










Glu 


Pro 


Phe 


Tyr 


Val 


Glu 


Arg 


Tyr 






35 










40 


Leu 


Ala 


Asp 


Thr 


Arg 


Lys 


Tyr 


His 




50 










55 




His 


Asp 


Phe 


Met 


Phe 


Val 


Lys 


Arg 


65 










70 






Asp 


Arg 


lie 


Tyr 


Tyr 


Leu 


Ala 


Met 










85 








Leu 


Phe 


Tyr 


Ser 


Glu 


He 


Pro 


Lys 








100 










Met 


Leu 


Ser 


Trp 


Lys 


Pro 


Leu 


Leu 






115 










120 


Tyr 


Gly 


Met 


Tyr 


Ser 


Arg 


Glu 


Glu 




130 










135 




lie 


Gly 


Thr 


Val 


Gly 


He 


Ala 


Ser 


145 










150 






Thr 


Phe 


Leu 


Phe 


Gin 


Ala 


Gly 


Ser 










165 








Gly 


Pro 


Gin 


Gly 


Phe 


Thr 


Gin 


Pro 



180 

Cys Ala 




PCI7US01/31874 



Gin 


Leu 
10 


Gly 


Val 


GlU 


He 


Phe 

15 


GlU 


Glu 


Ser 


Gin 


Asp 


Arg 


Pro 


Lys 


Leu 


25 










30 






Ser 


Trp 


Ser 


Gin 


Leu 
45 


Lys 


Lys 


Leu 


Gly 


Tyr 


Met 


Met 
60 


Ala 


Lys 


Ala 


Pro 


Asn 


Asp 


Pro 
75 


Asp 


Gly 


Pro 


His 


Ser 
80 


Ser 


Gly 
90 


Glu 


Asn 


Arg 


Glu 


Asn 
95 


Thr 


Thr 


He 


Asn 


Arg 


Ala 


Ala 


val 


Leu 


105 










110 






Asp 


Leu 


Phe 


Gin 


Ala 

125 


Thr 


Leu 


Asp 


Glu 


Leu 


Leu 


Arg 
140 


GlU 


Arg 


Lys 


Arg 


Tyr 


Asp 


Tyr 
155 


His 


Gin 


Gly 


Ser 


Gly 
160 


Gly 


He 
170 


Tyr 


His 


Val 


Lys 


Asp 
175 


Gly 


Leu 


Arg 


Pro 


Asn 


Leu 


Val 


Glu 


Thr 


185 










190 







<210> 18 

<211> 832 

<212> DNA 

<213> Homo sapiens 

<400> 18 



aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc wacaaccttt aaggcccaat 780 

ctagtggaaa ctasttgtsc caracytgca tgacccaatc agatcctgta ga 832 

<210> 19 

<211> 658 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Met Ala Ala Ala Met Glu Thr Glu Gin Leu Gly Val Glu He Phe Glu 

1 5 10 .15 

Thr Ala Asp Cys Glu Glu Asn He Glu Ser Gin Asp Arg Pro Lys Leu 

20 25 30 

Glu Pro Phe Tyr Val Glu Arg Tyr Ser Trp Ser Gin Leu Lys Lys Leu 

35 40 45 

Leu Ala Asp Thr Arg Lys Tyr His Gly Tyr Met Met Ala Lys Ala Pro 
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50 



55 



60 



His Asp Phe Met Phe Val Lys Arg Asn Asp Pro Asp Gly Pro His Ser 
65 70 75 80 

Asp Arg lie Tyr Tyr Leu Ala Met Ser Gly Glu Asn Arg Glu Asn Thr 

85 90 95 

Leu Phe Tyr Ser Glu lie Pro Lys Thr lie Asn Arg Ala Ala Val Leu 

100 105 110 

Met Leu Ser Trp Lys Pro Leu Leu Asp Leu Phe Gin Ala Thr Leu Asp 

115 120 125 

Tyr Gly Met Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg Lys Arg 

130 135 140 

lie Gly Thr Val Gly He Ala Ser Tyr Asp Tyr His Gin Gly Ser Gly 
145 150 155 160 

Thr Phe Leu Phe Gin Ala Gly Ser Gly He Tyr His Val Lys Asp Gly 

165 170 175 

Gly Pro Gin Gly Phe Thr Gin Gin Pro Leu Arg Pro Asn Leu Val Glu 

180 185 190 

Thr Ser Cys Pro Asn He Arg Met Asp Pro Lys Leu Cys Pro Ala Asp 

195 200 205 

Pro Asp Trp He Ala Phe He His Ser Asn Asp He Trp He Ser Asn 

210 215 220 

He Val Thr Arg Glu Glu Arg Arg Leu Thr Tyr Val His Asn Glu Leu 
225 230 235 240 

Ala Asn Met Glu Glu Asp Ala Arg Ser Ala Gly Val Ala Thr Phe Val 

245 250 255 

Leu Gin Glu Glu Phe Asp Arg Tyr Ser Gly Tyr Trp Trp Cys Pro Lys 

260 265 270 

Ala Glu Thr Thr Pro Ser Gly Gly Lys He Leu Arg He Leu Tyr Glu 

275 280 285 

Glu Asn Asp Glu Ser Glu Val Glu He He His Val Thr Ser Pro Met 

290 295 300 

Leu Glu Thr Arg Arg Ala Asp Ser Phe Arg Tyr Pro Lys Thr Gly Thr 
305 310 315 320 

Ala Asn Pro Lys Val Thr Phe Lys Met Ser Glu He Met He Asp Ala 

325 " 330 335 

Glu Gly Arg He He Asp Val He Asp Lys Glu Leu He Gin Pro Phe 

340 345 350 

Glu He Leu Phe Glu Gly Val Glu Tyr He Ala Arg Ala Gly Trp Thr 

355 360 365 

Pro Glu Gly Lys Tyr Ala Trp Ser He Leu Leu Asp Arg Ser Gin Thr 

370 375 380 

Arg Leu Gin He Val Leu He Ser Pro Glu Leu Phe He Pro Val Glu 
385 390 395 400 

Asp Asp Val Met Glu Arg Gin Arg Leu He Glu Ser Val Pro Asp Ser 

405 410 415 

Val Thr Pro Leu He He Tyr Glu Glu Thr Thr Asp He Trp He Asn 

420 425 430 

He His Asp He Phe His Val Phe Pro Gin Ser His Glu Glu Glu He 

435 440 445 

Glu Phe He Phe Ala Ser Glu Cys Lys Thr Gly Phe Arg His Leu Tyr 

450 455 460 

Lys He Thr Ser He Leu Lys Glu Ser Lys Tyr Lys Arg Ser Ser Gly 
465 470 475 480 

Gly Leu Pro Ala Pro Ser Asp Phe Lys Cys Pro He Lys Glu Glu He 

485 490 495 

Ala He Thr Ser Gly Glu Trp Glu Val Leu Gly Arg His Gly Ser Asn 

500 505 510 

He Gin Val Asp Glu Val Arg Arg Leu Val Tyr Phe Glu Gly Thr Lys 

515 520 525 

Asp Ser Pro Leu Glu His His Leu Tyr Val Val Ser Tyr Val Asn Pro 
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530 






535 


Gly 


Glu 


Val Thr 


Arg 


Leu Thr Asp 


545 








550 


lie 


Ser 


Gin His 


Cys 


Asp Phe Phe 








565 




Asn 


Pro 


His Cys 


Val 


Ser Leu Tyr 






580 






Pro 


Thr 


Cys Lys 


Thr 


Lys Glu Phe 






595 




600 


Gly 


Pro 


Leu Pro 


Asp 


Tyr Thr Pro 




610 






615 


Thr 


Thr 


Gly Phe 


Thr 


Leu Tyr Gly 


625 








630 


Gin 


Pro 


Gly Lys 


Lys 


Tyr Pro Thr 








645 




Val 


Lys 









540 

Arg Gly Tyr Ser His Ser Cys Cys 
555 560 
lie Ser Lys Tyr Ser Asn Gin Lys 

570 575 
Lys Leu Ser Ser Pro Glu Asp Asp 
585 590 
Trp Ala Thr lie Leu Asp Ser Ala 
605 

Pro Glu lie Phe Ser Phe Glu Ser 
620 

Met Leu Tyr Lys Pro His Asp Leu 
635 640 
Val Leu Phe lie Tyr Gly Gly Arg 
650 655 



<210> 20 

<211> 4676 

<212> DNA 

<213> Homo sapiens 

<400> 20 



aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg * 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttafc tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 960 

gctggagtcg ctacctttgt tctccaagaa gaatttgata gatattctgg ctattggtgg 1020 

tgtccaaaag ctgaaacaac tcccagtggt ggtaaaattc ttagaattct atatgaagaa 1080 

aatgatgaat ctgaggtgga aattattcat gttacatccc ctatgttgga aacaaggagg 1140 

gcagattcat tccgttatcc taaaacaggt acagcaaatc ctaaagtcac ttttaagatg 1200 

tcagaaataa tgattgatgc tgaaggaagg atcatagatg tcatagataa ggaactaatt 1260 

caaccttttg agattctatt tgaaggagtt gaatatattg ccagagctgg atggactcct 1320 

gagggaaaat atgcttggtc catcctacta gatcgctccc agactcgcct acagatagtg 1380 

ttgatctcac ctgaattatt tatcccagta gaagatgatg ttatggaaag gcagagactc 1440 

attgagtcag tgcctgattc tgtgacgcca ctaattatct atgaagaaac aacagacatc 1500 

tggataaata tccatgacat ctttcatgtt tttccccaaa gtcacgaaga ggaaattgag 1560 

tttatttttg cctctgaatg caaaacaggt ttccgtcatt tatacaaaat tacatctatt 1620 

ttaaaggaaa gcaaatataa acgatccagt ggtgggctgc ctgctccaag tgatttcaag 1680 

tgtcctatca aagaggagat agcaattacc agtggtgaat gggaagttct tggccggcat 1740 

ggatctaata tccaagttga tgaagtcaga aggctggtat attttgaagg caccaaagac 1800 

tcccctttag agcatcacct gtacgtagtc agttacgtaa atcctggaga ggtgacaagg 1860 

ctgactgacc gtggctactc acattcttgc tgcatcagtc agcactgtga cttctttata 1920 

agtaagtata gtaaccagaa gaatccacac tgtgtgtccc tttacaagct atcaagtcct 1980 

gaagatgacc caacttgcaa aacaaaggaa ttttgggcca ccattttgga ttcagcaggt 2040 

cctcttcctg actatactcc tccagaaatt ttctcttttg aaagtactac tggatttaca 2100 

ttgtatggga tgctctacaa gcctcatgat ctacagcctg gaaagaaata tcctactgtg 2160 

ctgttcatat atggtggtcg ggtcaaatag aaattgacga tcaggtggaa ggactccaat 2220 
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atctagcttc tcgatatgat ttcattgact tagatcgtgt gggcatccac ggctggtcct 2280 

atggaggata cctctccctg atggcattaa tgcagaggtc agatatcttc agggttgcta 2340 

ttgctggggc cccagtcact ctgtggatct tctatgatac aggatacacg gaacgttata 2400 

tgggtcaccc tgaccagaat gaacagggct attacttagg atctgtggcc atgcaagcag 2460 

aaaagttccc ctctgaacca aatcgtttac tgctcttaca tggtttcctg gatgagaatg 2520 

tccattttgc acataccagt atattactga gttttttagt gagggctgga aagccatatg 2580 

atttacagat ctatcctcag gagagacaca gcataagagt tcctgaatcg ggagaacatt 2640 

atgaactgca tcttttgcac t'accttcaag aaaaccttgg atcacgtatt gctgctctaa 2700 

aagtgatata attttgacct gtgtagaact ctctggtata cactggctat ttaaccaaat 2760 

gaggaggttt aatcaacaga aaacacagaa ttgatcatca cattttgata cctgccatgt 2820 

aacatctact cctgaaaata aatgtggtgc catgcagggg tctacggttt gtggtagtaa 2880 

tctaatacct taaccccaca tgctcaaaat caaatgatac atattcctga gagacccagc 2940 

aataccataa gaattactaa aaaaaaaaaa aaaaaaaaga cattagcacc atgtattcat 3 000 

actaccctat tttcactttt aatagtatta taaacttcat gaacttaatt agtgtatttt 3060 

tacagtatac ttttgagttt gttaaaatat gatgatatta gtgattggtt tggttcagtt 3120 

ccagaatctt tgactagtta cagatttgat agcacttaaa tgtaattgaa tagcttatgc 3180 

ttcattgctt gggcatatcc agcatgttat gaactaataa ctattaaact tgacttaacc 3240 

agtcattcat taataatttt tcaaggataa cttagtggcc tcctaaagac acttgttttg 3300 

gcactgacca gtttttagcc aatttaatct gtatctagta taaataattc tcatttttct 3360 

ttgatgatat taacagagtg ggcttttcct tttgcataaa ggctagtaac tgtatatgta 3420 

gcatggattt aattagtcat gatattgata attacaggca gaaaattttt aatcaaatga 3480 

ttagagctta aatatttgca ggcaagtttt tttttttcct ttaagaaaag gaaaaagtac 3540 

acattcacta gaattcttca gaaaatttag tggtgccagt ttccatttgg tatttcctta 3600 

ttaaaatatt ctagaatttt aaggagattg aagggaatca cagtggggtg gggagacctg 3660 

ggtttgggga atgacagaga gaagaggtgg tgagggcctg attaaaaact aagcagaagt 3720 

agttttaaca aaaatactca tgaaaatgtt tggaaactga aatttaaaca actgtaatat 3780 

taaggaaacc agaatcaata aatcactgtc ttgccagcac agctacagag taacatgatt 384 0 

caggggagga aaagttcctt agagttactt ttataattct tttttttttt cctcttaggt 3900 

ttagaaatct tacaaattta aactttatcc ttttaaaatt atttgaacat aatttagata 3960 

ttgtaagctt aaaatacaaa tgtttataga taacctcttt accataaact aatccctggc 4020 

aagccatggc tctctttttt tttttggtgt ttaaagcctg taaacagttt ttctgaatga 4080 

tcatgaactt ttcttggttt agcactagga tttagctatg aagagagctc ataggctttc 4140 

aggtgctaat tgagatctgc cctgttagag tcttggggtg ctagattggt cacattgaca 4200 

ccagtggcag ggaaggcatc tatgagtttg atgcttttta tcacacactt cagtgtttag 4260 

aaagttatta ccaatacttt taaacaacac tccaagaaaa tttgctatat ttctttctca 4320 

tcactacaga gagagtagat ttccccatag agagcacagc ctccattagt aaggttggtg 4380 

actattggta agaggtggac ttcattgaca ccaagtggga ggtagggaaa gcccagaaat 4440 

ggcaggatga tatggtggtt ctgtcgttgg gaaaggtatt gggttttgct gtttgtattt 4500 

atactgtata atagatacca cgctttttct tattatctgt atatgtattg cttttcatgt 4560 

ttgatatttt cccatgccaa gatttgttta tatatatttt caatgttaaa ttaaattgat 4620 

ttgggtaact ttcttcccca agaaagtatt ttccccctta agtataaatc tgactg 4676 

<210> 21 

<211> 613 

<212> PRT 

<213> Homo sapiens 

<400> 21 



Met Ala Ala 


Ala 


Met 


Glu Thr Glu Gin 


Leu Gly 


Val Glu He Phe 


Glu 


1 




5 




10 


15 




Thr Ala Asp 


Cys 


GlU 


Glu Asn lie Glu 


Ser Gin 


Asp Arg Pro Lys 


Leu 




20 




25 




30 




Glu Pro Phe 


Tyr 


Val 


Glu Arg Tyr Ser 


Trp Ser 


Gin Leu Lys Lys 


Leu 


35 






40 




45 




Leu Ala Asp 


Thr 


Arg 


Lys Tyr His Gly 


Tyr Met 


Met Ala Lys Ala 


Pro 


50 






55 




60 




His Asp Phe 


Met 


Phe 


Val Lys Arg Asn 


Asp Pro 


Asp Gly Pro His 


Ser 


65 






70 


75 




80 


Asp Arg lie 


Tyr 


Tyr 


Leu Ala Met Ser 


Gly Glu 


Asn Arg Glu Asn 


Thr 






85 




90 


95 
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Leu 


Phe 


Tyr 


Ser 


Glu 


lie 


Pro 


Lys 








100 










Met 


Leu 


Ser 


Trp 


Lys 


Pro 


Leu 


Leu 






115 










120 


Tyr 


Gly 


Met 


Tyr 


Ser 


Arg 


Glu 


Glu 




130 










135 




lie 


Gly 


Thr 


Val 


Gly 


He 


Ala 


Ser 


145 










150 






Thr 


Phe 


Leu 


Phe 


Gin 


Ala 


Gly 


Ser 










165 








Gly 


Pro 


Gin 


Gly 


Phe 


Thr 


Gin 


Gin 








180 










Thr 


Ser 


Cys 


Pro 


Asn 


lie 


Arg 


Met 






195 










200 


Pro 


Asp 


Trp 


He 


Ala 


Phe 


He 


His 




210 










215 




lie 


Val 


Thr 


Arg 


Glu 


Glu 


Arg 


Arg 


225 










230 






Ala 


Asn 


Met 


Glu 


Glu 


Asp 


Ala 


Arg 










245 








Leu 


Gin 


Glu 


Glu 


Phe 


Asp 


Arg 


Tyr 








260 










Ala 


Glu 


Thr 


Thr 


Pro 


Ser 


Gly 


Gly 






275 










280 


Glu 


Asn 


Asp 


Glu 


Ser 


Glu 


Val 


Glu 




290 










295 




Leu 


Glu 


Thr 


Arg 


Arg 


Ala 


Asp 


Ser 


305 










310 






Ala 


Asn 


Pro 


Lys 


Val 


Thr 


Phe 


Lys 










325 








Glu 


Gly 


Arg 


He 


He 


Asp 


Val 


He 








340 










Glu 


He 


Leu 


Phe 


Glu 


Gly 


Val 


Glu 






355 










360 


Pro 


Glu 


Gly 


Lys 


Tyr 


Ala 


Trp 


Ser 




370 










375 




Arg 


Leu 


Gin 


He 


Val 


Leu 


He 


Ser 


385 










390 






Asp 


Asp 


Val 


Met 


Glu 


Arg 


Gin 


Arg 










405 








Val 


Thr 


Pro 


Leu 


He 


He 


Tyr 


Glu 








420 










He 


His 


Asp 


He 


Phe 


His 


Val 


Phe 






435 










440 


Glu 


Phe 


He 


Phe 


Ala 


Ser 


Glu 


Cys 




450 










455 




Lys 


He 


Thr 


Ser 


He 


Leu 


Lys 


Glu 


465 










470 






Gly 


Leu 


Pro 


Ala 


Pro 


Ser 


Asp 


Phe 










485 








Ala 


He 


Thr 


Ser 


Gly 


Glu 


Trp 


Glu 








500 










lie 


Gin 


Val 


Asp 


/"tin 

G1U 


vai 


Arg 


Arg 






515 










520 


Asp 


Ser 


Pro 


Leu 


Glu 


His 


His 


Leu 




530 










535 




Gly 


Glu 


val 


Thr 


Arg 


Leu 


Thr 


Asp 


545 










550 






He 


Ser 


Gin 


His 


Cys 


Asp 


Phe 


Phe 



565 



Thr 


Tl mi VI __ TV .r mm 

He Asn Arg 


Ala Ala val Leu 


105 




110 


Asp 


Leu Pne Gin 


Aia Tnr Leu Asp 






125 


Glu 


Leu Leu Arg 


Glu Arg Lys Arg 




140 




Tyr 


Asp Tyr His 


Gin Gly Ser Gly 




155 


160 


Gly 


He Tyr His 


Val Lys Asp Gly 




170 


175 


Pro 


Leu Arg Pro 


Asn Leu Val Glu 


185 




190 


Asp 


Pro Lys Leu 


Cys Pro Ala Asp 






205 


Ser 


Asn Asp He 


Trp He Ser Asn 




220 




Leu 


Thr Tyr Val 


His Asn Glu Leu 




235 


240 


Ser 


Ala Gly Val 


Ala Thr Phe Val 




250 


255 


Ser 


Gly Tyr Trp 


Trp Cys Pro Lys 


265 




270 


Lys 


He Leu Arg 


He Leu Tyr Glu 






285 


He 


He His Val 


Thr Ser Pro Met 




300 




Phe 


Arg Tyr Pro 


Lys Thr Gly Thr 




315 


320 


Met 


Ser Glu He 


Met lie Asp Ala 




330 


335 


Asp 


Lys Glu Leu 


He Gin Pro Phe 


345 




350 


Tyr 


He Ala Arg 


Ala Gly Trp Thr 






365 


lie 


Leu Leu Asp 


Arg Ser Gin Thr 




380 




Pro 


Glu Leu Phe 


He Pro Val Glu 




395 


400 


Leu 


He Glu Ser 


Val Pro Asp Ser 




410 


415 


Glu 


Thr Thr Asp 


He Trp He Asn 


425 




430 


Pro 


Gin Ser His 


Glu Glu Glu lie 






445 


Lys 


Thr Gly Phe 


Arg His Leu Tyr 




460 




Ser 


Lys Tyr Lys 


Arg Ser Ser Gly 




475 


480 


Lys 


Cys Pro He 


Lys Glu Glu He 




490 


495 


Val 


Leu Gly Arg 


His Gly Ser Asn 


505 




510 


Leu 


Val Tyr Phe 


Glu Gly Thr Lys 






525 


Tyr 


val val Ser 


Tyr Val Asn Pro 




540 




Arg 


Gly Tyr Ser 


His Ser Cys Cys 




555 


560 


He 


Ser Lys Tyr 


Ser Asn Gin Lys 




570 


575 
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Asn Pro His 



Pro Thr Cys 
595 



Cys val Ser Leu Tyr Lys Leu Ser 

580 585 

Lys Thr Lys Glu Phe Trp Ala Thr 



Ser Pro Glu Asp Asp 
590 

He Leu Asp Ser Val 



600 



605 



Leu Arg Cys Ser Trp 
610 

<210> 22 

<211> 4685 

<212> DNA 

<213> Homo sapiens 

<400> 22 

aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 960 

gctggagtcg ctacctttgt tctccaagaa gaatttgata gatattctgg ctattggtgg 1020 

tgtccaaaag ctgaaacaac tcccagtggt ggtaaaattc ttagaattct atatgaagaa 1080 

aatgatgaat ctgaggtgga aattattcat gttacatccc ctatgttgga aacaaggagg 1140 

gcagattcat tccgttatcc taaaacaggt acagcaaatc ctaaagtcac ttttaagatg 1200 

tcagaaataa tgattgatgc tgaaggaagg atcatagatg tcatagataa ggaactaatt 1260 

caaccttttg agattctatt tgaaggagtt gaatatattg ccagagctgg atggactcct 1320 

gagggaaaat atgcttggtc catcctacta gatcgctccc agactcgcct acagatagtg 1380 

ttgatctcac ctgaattatt tatcccagta gaagatgatg ttatggaaag gcagagactc 1440 

attgagtcag tgcctgattc tgtgacgcca ctaattatct atgaagaaac aacagacatc 1500 

tggataaata tccatgacat ctttcatgtt tttccccaaa gtcacgaaga ggaaattgag 1560 

tttatttttg cctctgaatg caaaacaggt ttccgtcatt tatacaaaat tacatctatt 1620 

ttaaaggaaa gcaaatataa acgatccagt ggtgggctgc ctgctccaag tgatttcaag 1680 

tgtcctatca aagaggagat agcaattacc agtggtgaat gggaagttct tggccggcat 1740 

ggatctaata tccaagttga tgaagtcaga aggctggtat attttgaagg caccaaagac 1800 

tcccctttag agcatcacct gtacgtagtc agttacgtaa atcctggaga ggtgacaagg 1860 

ctgactgacc gtggctactc acattcttgc tgcatcagtc agcactgtga cttctttata 1920 

agtaagtata gtaaccagaa gaatccacac tgtgtgtccc tttacaagct atcaagtcct 1980 

gaagatgacc caacttgcaa aacaaaggaa ttttgggcca ccattttgga ttcagtcctc 2040 

aggtgcagtt ggtgaataat cggtttaaag gagtcaagta tttccgcttg aataccctag 2100 

cctctctagg ttatgtggtt gtagtgatag acaacagggg atcctgtcac cgagggctta 2160 

aatttgaagg cgcctttaaa tataaaatgg gtcaaataga aattgacgat caggtggaag 2220 

gactccaata tctagcttct cgatatgatt tcattgactt agatcgtgtg ggcatccacg 2280 

gctggtccta tggaggatac ctctccctga tggcattaat gcagaggtca gatatcttca 2340 

gggttgctat tgctggggcc ccagtcactc tgtggatctt ctatgataca ggatacacgg 2400 

aacgttatat gggtcaccct gaccagaatg aacagggcta ttacttagga tctgtggcca 2460 

tgcaagcaga aaagttcccc tctgaaccaa atcgtttact gctcttacat ggtttcctgg 2520 

atgagaatgt ccattttgca cataccagta tattactgag ttttttagtg agggctggaa 2580 

agccatatga tttacagatc tatcctcagg agagacacag cataagagtt cctgaatcgg 2640 

gagaacatta tgaactgcat cttttgcact accttcaaga aaaccttgga tcacgtattg 2700 

ctgctctaaa agtgatataa ttttgacctg tgtagaactc tctggtatac actggctatt 2760 

taaccaaatg aggaggttta atcaacagaa aacacagaat tgatcatcac attttgatac 2820 
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ctgccatgta acatctactc ctgaaaataa 
tggtagtaat ctaatacctt aaccccacat 
agacccagca ataccataag aattactaaa 
tgtattcata ctaccctatt ttcactttta 
gtgtattttt acagtatact tttgagtttg 
ggttcagttc cagaatcttt gactagttac 
agcttatgct tcattgcttg ggcatatcca 
gacttaacca gtcattcatt aataattttt 
cttgttttgg cactgaccag tttttagcca 
catttttctt tgatgatatt aacagagtgg 
gtatatgtag catggattta attagtcatg 
atcaaatgat tagagcttaa atatttgcag 
aaaaagtaca cattcactag aattcttcag 
atttccttat taaaatattc tagaatttta 
ggagacctgg gtttggggaa tgacagagag 
agcagaagta gttttaacaa aaatactcat 
ctgtaatatt aaggaaacca gaatcaataa 
aacatgattc aggggaggaa aagttcctta 
ctcttaggtt tagaaatctt acaaatttaa 
atttagatat tgtaagctta aaatacaaat 
atccctggca agccatggct ctcttttttt 
tctgaatgat catgaacttt tcttggttta 
taggctttca ggtgctaatt gagatctgcc 
acattgacac cagtggcagg gaaggcatct 
agtgtttaga aagttattac caatactttt 
tctttctcat cactacagag agagtagatt 
aggttggtga ctattggtaa gaggtggact 
cccagaaatg gcaggatgat atggtggttc 
tttgtattta tactgtataa tagataccac 
ttttcatgtt tgatattttc ccatgccaag 
taaattgatt tgggtaactt tcttccccaa 
gactg 

<210> 23 

<211> 892 

<212> PRT 

<213> Homo sapiens 

<400> 23 



atgtggtgcc atgcaggggt ctacggtttg 2880 

gctcaaaatc aaatgataca tattcctgag 2940 

aaaaaaaaaa aaaaaaagac attagcacca 3000 

atagtattat aaacttcatg aacttaatta 3060 

ttaaaatatg atgatattag tgattggttt 3120 

agatttgata gcacttaaat gtaattgaat 3180 

gcatgttatg aactaataac tattaaactt 3240 

caaggataac ttagtggcct cctaaagaca 3300 

atttaatctg tatctagtat aaataattct 3360 

gcttttcctt ttgcataaag gctagtaact 3420 

atattgataa ttacaggcag aaaattttta 3480 

gcaagttttt ttttttcctt taagaaaagg 3540 

aaaatttagt ggtgccagtt tccatttggt 3600 

aggagattga agggaatcac agtggggtgg 3660 

aagaggtggt gagggcctga ttaaaaacta 3720 

gaaaatgttt ggaaactgaa atttaaacaa 3780 

atcactgtct tgccagcaca gctacagagt 3840 

gagttacttt tataattctt tttttttttc 3900 

actttatcct tttaaaatta tttgaacata 3960 

gtttatagat aacctcttta ccataaacta 4020 

ttttggtgtt taaagcctgt aaacagtttt 40B0 

gcactaggat ttagctatga agagagctca 4140 

ctgttagagt cttggggtgc tagattggtc 4200 

atgagtttga tgctttttat cacacacttc 4260 

aaacaacact ccaagaaaat ttgctatatt 4320 

tccccataga gagcacagcc tccattagta 4380 

tcattgacac caagtgggag gtagggaaag 4440 

tgtcgttggg aaaggtattg ggttttgctg 4500 

gctttttctt attatctgta tatgtattgc 4560 

atttgtttat atatattttc aatgttaaat 4620 

gaaagtattt tcccccttaa gtataaatct 4680 
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<210> 24 

<211> 4302 

<212> DNA 

<213> Homo sapiens 

<400> 24 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg .300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 
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ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagatctg ggtcaatgag gagaccaagc tggtgtactt ccagggcacc 1920 

aaggacacgc cgctggagca ccacctctac gtggtcagct atgaggcggc cggcgagatc 1980 

gtacgcctca ccacgcccgg cttctcccat agctgctcca tgagccagaa cttcgacatg 2040 

ttcgtcagcc actacagcag cgtgagcacg ccgccctgcg tgcacgtcta caagctgagc 2100 

ggccccgacg acgaccccct gcacaagcag ccccgcttct gggctagcat gatggaggca 2160 

gccagctgcc ccccggatta tgttcctcca gagatcttcc atttccacac gcgctcggat 2220 

gtgcggctct acggcatgat ctacaagccc cacgccttgc agccagggaa gaagcacccc 2280 

accgtcctct ttgtatatgg aggcccccag gtgcagctgg tgaataactc cttcaaaggc 2340 

atcaagtact tgcggctcaa cacactggcc tccctgggct acgccgtggt tgtgattgac 2400 

ggcaggggct cctgtcagcg agggcttcgg ttcgaagggg ccctgaaaaa ccaaatgggc 2460 

caggtggaga tcgaggacca ggtggagggc ctgcagttcg tggccgagaa gtatggcttc 2520 

atcgacctga gccgagttgc catccatggc tggtcctacg ggggcttcct ctcgctcatg 2580 

gggctaatcc acaagcccca ggtgttcaag gtggccatcg cgggtgcccc ggtcaccgtc 2640 

tggatggcct acgacacagg gtacactgag cgctacatgg acgtccctga gaacaaccag 2700 

cacggctatg aggcgggttc cgtggccctg cacgtggaga agctgcccaa tgagcccaac 2760 

cgcttgctta tcctccacgg cttcctggac gaaaacgtgc actttttcca cacaaacttc 2820 

ctcgtctccc aactgatccg agcagggaaa ccttaccagc tccagatcta ccccaacgag 2880 

agacacagta ttcgctgccc cgagtcgggc gagcactatg aagtcacgtt gctgcacttt 2940 

ctacaggaat acctctgagc ctgcccaccg ggagccgcca catcacagca caagtggctg 3000 

cagcctccgc ggggaaccag gcgggaggga ctgagtggcc cgcgggcccc agtgaggcac 3060 

tttgtcccgc ccagcgctgg ccagccccga ggagccgctg ccttcaccgc cccgacgcct 3120 

tttatccttt tttaaacgct cttgggtttt atgtccgctg cttcttggtt gccgagacag 3180 

agagatggtg gtctcgggcc agcccctcct ctccccgcct tctgggagga ggaggtcaca 3240 

cgctgatggg cactggagag gccagaagag actcagagga gcgggctgcc ttccgcctgg 3300 

ggctccctgt gacctctcag tcccctggcc cggccagcca ccgtccccag cacccaagca 3360 

tgcaattgcc tgtccccccc ggccagcctc cccaacttga tgtttgtgtt ttgtttgggg 3420 

ggatattttt cataattatt taaaagacag gccgggcgcg gtggctcacg tctgtaatcc 3480 

cagcactttg ggaggctgag gcgggcggat cacctgaggt tgggagttca agaccagcct 3540 

ggccaacatg gggaaacccc gtctctacta aaaatacaaa aaattagccg ggtgtggtgg 3600 

cgcgtgccta taatcccagc tactcgggag gctgaggcag gagaatcgct tgaacccggg 3660 

aggtggaggt tgcggtgagc caagatcgca ccattgcact ccagcctggg caacaagagc 3720 

gaaactctgt ctcaaaataa ataaaaaata aaagacagaa agcaaggggt gcctaaatct 3780 

agacttgggg tccacaccgg gcagcggggt tgcaacccag cacctggtag gctccatttc 3840 

ttcccaagcc cgagcagagg gtcatgcggg ccccacagga gaagcggcca gggcccgcgg 3900 

ggggcaccac ctgtggacag ccctcctgtc cccaagcttt caggcaggca ctgaaacgca 3960 

ccgaacttcc acgctctgct ggtcagtggc ggctgtcccc tccccagccc agccgcccag 4020 

ccacatgtgt ctgcctgacc cgtacacacc aggggttccg gggttgggag ctgaaccatc 4080 

cccacctcag ggttatattt ccctctcccc ttccctcccc gccaagagct ctgccagggg 4140 

cgggcaaaaa aaaaagtaaa aagaaaagaa aaaaaaaaaa aagaaacaaa ccacctctac 4200 

atattatgga aagaaaatat ttttgtcgat tcttattctt ttataattat gcgtggaaga 4260 

agtagacaca ttaaacgatt ccagttggaa acatgtcacc tg .4302 

<210> 25 

<211> 518 

<212> PRT 

<213> Homo sapiens 

<400> 25 

Met Arg Lys Val Lys Lys Leu Arg Leu Asp Lys Glu Asn Thr Gly Ser 

1 ~ 5 10 15 

Trp Arg Ser Phe Ser Leu Asn Ser Glu Gly Ala Glu Arg Met Ala Thr 
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20 



Thr 


Gly 


Thr 


Pro 


Thr 


Ala Asp Arg 






35 






40 


Pro 


Ala 


Ala 


Arg 


Phe 


Gin Val Gin 




50 








55 


ser 


He 


He 


His 


Gly 


Ser Arg Lys 


65 










70 


Ala 


Pro 


His 


Asp 


Phe 


Gin Phe Val 










85 




His 


Ser 


His 


Arg 


Leu 


Tyr Tyr Leu 








100 






Asn 


Ser 


Leu 


Leu 


Tyr 


Ser Glu He 






115 






120 


Leu 


Leu 


Leu 


Leu 


Ser 


Trp Lys Gin 




130 








135 


Pro 


His 


His 


Gly 


val 


Tyr Ser Arg 


145 










150 


Lys 


Arg 


Leu 


Gly 


Val 


Phe Gly He 










165 




Ser 


Gly 


Leu 


Phe 


Leu 


Phe Gin Ala 








180 






Asp 


Gly 


Gly 


Lys 


Asn 


Gly Phe Met 






195 






200 


lie 


Lys 


Thr 


Gin 


Cys 


Ser Gly Pro 




210 








215 


Ala 


Asp 


Pro 


Ala 


Phe 


Phe Ser Phe 


225 










230 


Ala 


Asn 


He 


Glu 


Thr 


Gly Glu Glu 










245 




Gly 


Leu 


Ser 


Asn 


Val 


Leu Asp Asp 








260 






Phe 


Val 


He 


Gin 


Glu 


Glu Phe Asp 






275 






280 


Pro 


Thr 


Ala 


Ser 


Trp 


Glu Gly Ser 




290 








295 


Leu 


Tyr 


Glu 


Glu 


Val 


Asp Glu Ser 


305 










310 


Ser 


Pro 


Ala 


Leu 


Glu 


Glu Arg Lys 










325 




Thr 


Gly 


Ser 


Lys 


Asn 


Pro Lys He 








340 






Thr 


Asp 


Ser 


Gin 


Gly 


Lys He Val 






355 






360 


Gin 


Pro 


Phe 


Ser 


Ser 


Leu Phe Pro 




370 








375 


Gly 


Trp 


Thr 


Arg 


Asp 


Gly Lys Tyr 


385 










390 


Pro 


Gin 


Gin 


Trp 


Leu 


Gin Leu Val 










405 




Pro 


Ser 


Thr 


Glu 


Asn 


Glu Glu Gin 








420 






Pro 


Arg 


Asn 


Val 


Gin 


Pro Tyr Val 






435 






440 


Trp 


He 


Asn 


Val 


His 


Asp He Phe 




450 








455 


Glu 


Asp 


Glu 


Leu 


Cys 


Phe Leu Arg 


465 










470 


Cys 


His 


Leu 


Tyr 


Lys 


Val Thr Ala 










485 




Trp 


Ser 


Glu 


Pro 


Phe 


Ser Pro Gly 



25 




30 






Gly 


Asp Ala Ala Ala 


Thr 


Asp 


Asp 




45 








Lys 


His Ser Trp Asp 


Gly 


Leu 


Arg 




60 








Tyr 


Ser Gly Leu He 


Val 


Asn 


Lys 




75 






80 


Gin 


Lys Thr Asp Glu 


Ser 


Gly 


Pro 




90 




95 




Gly 


Met Pro Tyr Gly 


Ser 


Arg 


Glu 


105 




110 






Pro 


Lys Lys Val Arg 


Lys 


Glu 


Ala 




125 








Met 


Leu Asp His Phe 


Gin 


Ala 


Thr 




140 








Glu 


Glu Glu Leu Leu 


Arg 


Glu 


Arg 




155 






160 


Thr 


Ser Tyr Asp Phe 


His 


Ser 


Glu 




170 




175 




Ser 


Asn Ser Leu Phe 


His 


Cys 


Arg 


185 




190 






Val 


Ser Pro Met Lys 


Pro 


Leu 


Glu 




205 








Arg 


Met Asp Pro Lys 


He 


Cys 


Pro 




220 








He 


Asn Asn Ser Asp 


Leu 


Trp 


Val 




235 






240 


Arg 


Arg Leu Thr Phe 


Cys 


His 


Gin 




250 




255 




Pro 


Lys Ser Ala Gly 


Val 


Ala 


Thr 


265 




270 






Arg 


Phe Thr Gly Tyr 


Trp 


Trp 


Cys 




285 








Glu 


Gly Leu Lys Thr 


Leu 


Arg 


He 




300 








Glu 


Val Glu val He 


His 


Val 


Pro 




315 






320 


Thr 


Asp Ser Tyr Arg 


Tyr 


Pro 


Arg 




330 




335 




Ala 


Leu Lys Leu Ala 


Glu 


Phe 


Gin 


345 




350 






Ser 


Thr Gin Glu Lys 


Glu 


Leu 


Val 




365 








Lys 


Val Glu Tyr He 


Ala 


Arg 


Ala 




380 








Ala 


Trp Ala Met Phe 


Leu 


Asp 


Arg 




395 






400 


Leu 


Leu Pro Pro Ala 


Leu 


Phe 


He 




410 




415 




Arg 


Leu Ala Ser Ala 


Arg 


Ala 


Val 


425 




430 






Val 


Tyr Glu Glu Val 


Thr 


Asn 


Val 




445 








Tyr 


Pro Phe Pro Gin 


Ser 


Glu 


Gly 




460 








Ala 


Asn Glu Cys Lys 


Thr 


Gly 


Phe 




475 






480 


Val 


Leu Lys Ser Gin 


Gly 


Tyr 


Asp 




490 




495 




Glu 


Gly Glu Gin Ser 


Leu 


Thr 


Asn 
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500 

Ala Val Asp Ser Ser Arg 
515 



505 



510 



<210> 26 

<211> 2411 

<212> DNA 

<213> Homo sapiens 

<400> 26 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 
tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 
tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 
cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 
gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 
cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 
gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 
gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 
cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 
gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 
tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 
gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 
ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 
atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 
ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 
gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 
gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 
gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 
gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 
tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 
gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 
aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 
ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 
gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 
cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 
gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 
tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 
gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 
ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 
gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaaggtgag 
cagagcctga cgaatgctgt cgactcatcg cgttagtcac gtgtggttca atatgctgtt 
tgttcattgg tcggcccccc cactcagcca gcacaccctg cgggagaagg aacagggatc 
ggcaggaagc cagccttccc cagtgactgc atgatctggc agggcttaga gcacccaact 
gttggcttat tcaggcagca gatttactga gcacctcccc tgtgccaggc ccttagcaca 
accaggggtt ggccacctac ggcccacagg tcaaatccgg cccaccacct gtgttcataa 
ataaagtttt attggcactg agccacagcc acttgtttac agagactgtc tgtggtcgct 
tttgtgctgc agcagcagaa ctgggtagtc ccagcagaaa ctgttgtgca aggccaagat 
ttactgtcta gccctttgta gaaacatttg ccagctcctg ctgtaggtag ctgtgatgga 
attgttcact gtaaataaag aaaaaggaaa atccctgctc ttgggacctt ctagtggagg 
aggcagtatt ccagaaacag ttagaggtgc tgcctctggt gtgctgtggg tggcagatgc 
agatcctagt c 

<210> 27 

<211> 892 

<212> PRT 

<213> Homo sapiens 

<400> 27 

Met Arg Lys Val Lys Lys Leu Arg Leu Asp Lys Glu Asn Thr Gly Ser 
1 5 10 15 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2411 
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Trp 


Arg 


Ser 


Phe 








20 


Thr 


Gly 


Thr 


Pro 






35 




Pro 


Ala 


Ala 


Arg 




50 






Ser 


He 


He 


His 


65 








Ala 


Pro 


His 


Asp 


His 


Ser 


His 


Arg 








100 


Asn 


Ser 


Leu 


Leu 






115 




Leu 


Leu 


Leu 


Leu 




130 






Pro 


His 


His 


Gly 


145 








Lys 


Arg 


Leu 


Gly 


Ser 


Gly 


Leu 


Phe 








180 


ASD 


Glv 


Glv 


LVS 






195 




He 


Lys 


Thr 


Gin 




210 






Ala 


Asp 


Pro 


Ala 


225 








Ala 


Asn 


He 


Glu 


Gly 


Leu 


Ser 


Asn 








260 


Phe 


Val 


He 


Gin 






275 




Pro 


Thr 


Ala 


Ser 




290 






Leu 


Tyr 


GlU 


Glu 


305 








Ser 


Pro 


Ala 


Leu 


Thr 


Gly 


Ser 


Lys 








340 


Thr 


Asp 


Ser 


Gin 






355 




Gin 


Pro 


Phe 


Ser 




370 






Gly 


Trp 


Thr 


Arg 


385 








Pro 


Gin 


Gin 


Trp 


Pro 


Ser 


Thr 


Glu 








420 


Pro 


Arg 


Asn 


Val 






435 




Trp 


He 


Asn 


Val 




450 






Glu 


Asp 


Glu 


Leu 


465 








Cys 


His 


Leu 


Tyr 



Ser 


Leu 


Asn 


Ser 


Thr 


Ala 


Asp 


Arg 








40 


Phe 


Gin 


Val 


Gin 






55 




Gly 


Ser 


Arg 


Lys 




70 






Phe 


Gin 


Phe 


Val 


85 








Leu 


Tyr 


Tyr 


Leu 


Tyr 


Ser 


Glu 


He 








120 


Ser 


Tro 


Lvs 


Gin 






135 




Val 


Tyr 


Ser 


Arg 




150 






Val 


Phe 


Gly 


He 


165 








Leu 


Phe 


Gin 


Ala 


Asn 


Gly 


Phe 


Met 








200 


Cvs 


Ser 


Glv 


Pro 






215 




Phe 


Phe 


Ser 


Phe 




230 






Thr 


Gly 


Glu 


Glu 


245 








Val 


Leu 


Asp 


Asp 


Glu 


Glu 


Phe 


Asp 








280 


Tro 


Glu 


Glv 


Ser 






295 




Val 


Asp 


Glu 


Ser 




310 






Glu 


Glu 


Arg 


Lys 


325 








Asn 


Pro 


Lys 


He 


Gly 


Lys 


He 


Val 








360 


Ser 


Leu 


Phe 


Pro 






375 




Asp 


Gly 


Lys 


Tyr 




390 






Leu 


Gin 


Leu 


Val 


405 








Asn 


Glu 


Glu 


Gin 


Gin 


Pro 


Tyr 


Val 








440 


His 


Asp 


He 


Phe 






455 




Cys 


Phe 


Leu 


Arg 




470 






Lys 


Val 


Thr 


Ala 



485 



Glu 


Gly 


Ala 


Glu 


25 








Gly 


Asp 


Ala 


Ala 


Lys 


••J _ 

His 


Ser 


Trp 








60 


Tyr 


Ser 


Gly 


Leu 






75 




Gin 


Lys 


Thr 


Asp 




90 






Gly 


Met 


Pro 


Tyr 


105 








Pro 


Lys 


Lys 


Val 


Met 


Leu 


Asp 


His 








140 


Glu 


GlU 


Glu 


Leu 






155 




Thr 


Ser 


Tyr 


Asp 




170 






Ser 


Asn 


Ser 


Leu 


185 








Val 


Ser 


Pro 


Met 


Arg 


Met 


Asp 


Pro 








220 


He 


Asn 


Asn 


Ser 






235 




Arg 


Arg 


Leu 


Thr 




250 






Pro 


Lys 


Ser 


Ala 


265 








Arg 


Phe 


Thr 


Gly 


Glu 


Gly 


Leu 


Lys 








300 


Glu 


Val 


Glu 


Val 






315 




Thr 


Asp 


Ser 


Tyr 




330 






Ala 


Leu 


Lys 


Leu 


345 








Ser 


Thr 


Gin 


Glu 


Lys 


Val 


Glu 


Tyr 








380 


Ala 


Trp 


Ala 


Met 






395 




Leu 


Leu 


Pro 


Pro 




410 






Arg 


Leu 


Ala 


Ser 


425 








Val 


Tyr 


Glu 


GlU 


Tyr 


Pro 


Phe 


Pro 








460 


Ala 


Asn 


Glu 


Cys 






475 




Val 


Leu 


Lys 


Ser 



490 



Arg 


Met Ala 


Thr 




30 




Ala 


Thr Asp 


Asp 


45 






Asp 


Gly Leu 


Arg 


He 


Val Asn 


Lys 






80 


Glu 


Ser Gly 


Pro 




95 




Gly 


Ser Arg 


Glu 




110 




Arg 


Lys Glu 


Ala 


125 






Phe 


Gin Ala 


Thr 


Leu 


Arg Glu 


Arg 






160 


Phe 


His Ser 


Glu 




175 




Phe 


His Cys 


Arg 




190 




Lys 


Pro Leu 


Glu 


205 






Lys 


He Cys 


Pro 


Asp 


Leu Trp 


Val 






240 


Phe 


Cys His 


Gin 




255 




Gly 


Val Ala 


Thr 




270 




Tyr 


Trp Trp 


Cys 


285 






Thr 


Leu Arg 


He 


He 


His Val 


Pro 






320 


Arg 


Tyr Pro 


Arg 




335 




Ala 


Glu Phe 


Gin 




350 




Lys 


Glu Leu 


Val 


365 






He 


Ala Arg 


Ala 


Phe 


Leu Asp 


Arg 






400 


Ala 


Leu Phe 


He 




415 




Ala 


Arg Ala 


Val 




430 




Val 


Thr Asn 


Val 


445 






Gin 


Ser Glu 


Gly 


Lys 


Thr Gly 


Phe 






480 


Gin 


Gly Tyr 


Asp 




495 
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Trp 


Ser 


Glu 


Pro 
500 


Phe 


Ser 


Pro 


Gly Glu Asp 
505 


Glu Phe 


Lys 


Cys 
510 


Pro 


He 


Lys 


Glu 


Glu 
515 


He 


Ala 


Leu 


Thr 


Ser Gly Glu 
520 


Trp Glu 


Val 
525 


Leu 


Ala 


Arg 


His 


Gly 
530 


Ser 


Lys 


He 


Trp 


Val 
535 


Asn Glu Glu 


Thr Lys 
540 


Leu 


val 


Tyr 


Phe 


Gin 


Gly 


Thr 


Lys 


Asp 


Thr 


Pro 


Leu Glu His 


His Leu 


Tyr 


Val 


Val 


Ser 


545 










550 






555 








560 


Tyr 


Glu 


Ala 


Ala 


Gly 
565 


Glu 


He 


Val Arg Leu 

570 


Thr Thr 


Pro 


Gly 


Phe 
575 


Ser 


His 


Ser 


Cys 


Ser 
580 


Met 


Ser 


Gin 


Asn Phe Asp 
585 


Met Phe 


Val 


Ser 
590 


His 


Tyr 


ser 


Ser 


val 
595 


Ser 


Thr 


Pro 


Pro 


Cys Val His 
600 


Val Tyr 


Lys 
605 


Leu 


Ser 


Gly 


Pro 


Asp 
610 


Asp 


Asp 


Pro 


Leu 


His 
615 


Lys Gin Pro 


Arg Phe 
620 


Trp 


Ala 


Ser 


Met 


Met 


Glu 


Ala 


Ala 


Ser 


Cys 


Pro 


Pro Asp Tyr 


Val Pro 


Pro 


Glu 


He 


Phe 


625 










630 






635 








640 


His 


Phe 


His 


Thr 


Arg 
645 


Ser 


Asp 


Val Arg Leu 
650 


Tyr Gly 


Met 


He 


Tyr 
655 


Lys 


Pro 


HIS 


Ala 


Leu 
660 


Gin 


Pro 


Gly 


Lys Lys His 
665 


Pro Thr 


Val 


Leu 
670 


Phe 


Val 


Tyr 


Gly 


Gly 
675 


Pro 


Gin 


Val 


Gin 


Leu Val Asn 
680 


Asn Ser 


Phe 
685 


Lys 


Gly 


He 


Lys 


Tyr 
690 


Leu 


Arg 


Leu 


Asn 


Thr 
695 


Leu Ala Ser 


Leu Gly 
700 


Tyr 


Ala 


Val 


Val 


val 


lie 


Asp 


Gly 


Arg 


Gly 


Ser 


Cys Gin Arg 


Gly Leu 


Arg 


Phe 


Glu 


Gly 


705 










710 






715 








720 


Ala 


Leu 


Lys 


Asn 


Gin 
725 


Met 


Gly 


Gin Val Glu 
730 


He Glu 


Asp 


Gin 


Val 
735 


Glu 


Gly 


Leu 


Gin 


Phe 
740 


Val 


Ala 


Glu 


Lys Tyr Gly 
745 


Phe He 


Asp 


Leu 
750 


Ser 


Arg 


Val 


Ala 


He 
755 


His 


Gly 


Trp 


Ser 


Tyr Gly Gly 
760 


Phe Leu 


Ser 
765 


Leu 


Met 


Gly 


Leu 


lie 
770 


His 


Lys 


Pro 


Gin 


Val 
775 


Phe Lys Val 


Ala He 
780 


Ala 


Gly 


Ala 


Pro 


Val 


Thr 


Val 


Trp 


Met 


Ala 


Tyr 


Asp Thr Gly 


Tyr Thr 


Glu 


Arg 


Tyr 


Met 


785 










790 






795 








800 


Asp 


val 


Pro 


Glu 


Asn 
805 


Asn 


Gin 


His Gly Tyr 
810 


Glu Ala 


Gly 


Ser 


Val 
815 


Ala 


Leu 


His 


Val 


Glu 
820 


Lys 


Leu 


Pro 


Asn Glu Pro 
825 


Asn Arg 


Leu 


Leu 
830 


lie 


Leu 


His 


Gly 


Phe 
835 


Leu 


Asp 


Glu 


Asn 


Val His Phe 
840 


Phe His 


Thr 
845 


Asn 


Phe 


Leu 


Val 


Ser 
850 


Gin 


Leu 


He 


Arg 


Ala 
855 


Gly Lys Pro 


Tyr Gin 
860 


Leu 


Gin 


He 


Tyr 
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Pro Asn Glu Arg His Ser lie Arg Cys Pro Glu 
865 870 875 

Glu Val Thr Leu Leu His Phe Leu Gin Glu Tyr 




885 890 



<210> 28 

<211> 4219 

<212> DNA 

<213> Homo sapiens 

<400> 28 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 18 0 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagatctg ggtcaatgag gagaccaagc tggtgtactt ccagggcacc 1920 

aaggacacgc cgctggagca ccacctctac gtggtcagct atgaggcggc cggcgagatc 1980 

gtacgcctca ccacgcccgg cttctcccat agctgctcca tgagccagaa cttcgacatg 2040 

ttcgtcagcc actacagcag cgtgagcacg ccgccctgcg tgcacgtcta caagctgagc 2100 

ggccccgacg acgaccccct gcacaagcag ccccgcttct gggctagcat gatggaggca 2160 

gccagctgcc ccccggatta tgttcctcca gagatcttcc atttccacac gcgctcggat 2220 

gtgcggctct acggcatgat ctacaagccc cacgccttgc agccagggaa gaagcacccc 2280 

accgtcctct ttgtatatgg aggcccccag gtgcagctgg tgaataactc cttcaaaggc 2340 

atcaagtact tgcggctcaa cacactggcc tccctgggct acgccgtggt tgtgattgac 2400 

ggcaggggct cctgtcagcg agggcttcgg ttcgaagggg ccctgaaaaa ccaaatgggc 2460 

caggtggaga tcgaggacca ggtggagggc ctgcagttcg tggcegagaa gtatggcttc 2520 

atcgacctga gccgagttgc catccatggc tggtcctacg ggggcttcct ctcgctcatg 2580 

gggctaatcc acaagcccca ggtgttcaag gtggccatcg cgggtgcccc ggtcaccgtc 2640 

tggatggcct acgacacagg gtacactgag cgctacatgg acgtccctga gaacaaccag 2700 

cacggctatg aggcgggttc cgtggccctg cacgtggaga agctgcccaa tgagcccaac 2760 

cgcttgctta tcctccacgg cttcctggac gaaaacgtgc actttttcca cacaaacttc 2820 

ctcgtctccc aactgatccg agcagggaaa ccttaccagc tccagatcta ccccaacgag 2880 

agacacagta ttcgctgccc cgagtcgggc gagcactatg aagtcacgtt gctgcacttt 2940 
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ctacaggaat acctctgagc ctgcccaccg ggagccgcca catcacagca caagtggctg 3000 

cagcctccgc ggggaaccag gcgggaggga ctgagtggcc cgcgggcccc agtgaggcac 3060 

tttgtcccgc ccagcgctgg ccagccccga ggagccgctg ccttcaccgc cccgacgcct 3120 

tttatccttt tttaaacgct cttgggtttt atgtccgctg cttcttggtt gccgagacag 3180 

agagatggtg gtctcgggcc agcccctcct ctccccgcct tctgggagga ggaggtcaca 324 0 

cgctgatggg cactggagag gccagaagag actcagagga gcgggctgcc ttccgcctgg 3300 

ggctccctgt gacctctcag tcccctggcc cggccagcca ccgtccccag cacccaagca 3360 

tgcaattgcc tgtccccccc ggccagcctc cccaacttga tgtttgtgtt ttgtttgggg 3420 

ggatattttt cataattatt taaaagacag gccgggcgcg gtggctcacg tctgtaatcc 3480 

cagcactttg ggaggctgag gcgggcggat cacctgaggt tgggagttca agaccagcct 3540 

ggccaacatg gggaaacccc gtctctacta aaaatacaaa aaattagccg ggtgtggtgg 3600 

cgcgtgccta taatcccagc tactcgggag gctgaggcag gagaatcgct tgaacccggg 3660 

aggtggaggt tgcggtgagc caagatcgca ccattgcact ccagcctggg caacaagagc 3720 

gaaactctgt ctcaaaataa ataaaaaata aaagacagaa agcaaggggt gcctaaatct 3780 

agacttgggg tccacaccgg gcagcggggt tgcaacccag cacctggtag gctccatttc 3840 

ttcccaagcc cgactttcag gcaggcactg aaacgcaccg aacttccacg ctctgctggt 3900 

cagtggcggc tgtcccctcc ccagcccagc cgcccagcca catgtgtctg cctgacccgt 3960 

acacaccagg ggttccgggg ttgggagctg aaccatcccc acctcagggt tatatttccc 4020 

tctccccttc cctccccgcc aagagctctg ccaggggcgg gcaaaaaaaa aagtaaaaag 4080 

aaaagaaaaa aaaaaaaaag aaacaaacca cctctacata ttatggaaag aaaatatttt 4140 

tgtcgattct tattctttta taattatgcg tggaagaagt agacacatta aacgattcca 4200 

gttggaaaca tgtcacctg 4219 
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<210> 30 

<211> 4159 

<212> DMA 

<213> Homo sapiens 

<400> 30 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 132 0 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagatctg ggtcaatgag gagaccaagc tggtgtactt ccagggcacc 1920 

aaggacacgc cgctggagca ccacctctac gtggtcagct atgaggcggc cggcgagatc 1980 

gtacgcctca ccacgcccgg cttctcccat agctgctcca tgagccagaa cttcgacatg 2040 

ttcgtcagcc actacagcag cgtgagcacg ccgccctgcg tgcacgtcta caagctgagc 2100 

ggccccgacg acgaccccct gcacaagcag ccccgcttct gggctagcat gatggaggca 2160 

gccagctgcc ccccggatta tgttcctcca gagatcttcc atttccacac gcgctcggat 2220 

gtgcggctct acggcatgat ctacaagccc cacgccttgc agccagggaa gaagcacccc 2280 

accgtcctct ttgtatatgg aggcccccag gtgcagctgg tgaataactc cttcaaaggc 2340 

atcaagtact tgcggctcaa cacactggcc tccctgggct acgccgtggt tgtgattgac 2400 
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ggcaggggct cctgtcagcg agggcttcgg ttcgaagggg ccctgaaaaa ccaaatgggc 2460 

caggtggaga tcgaggacca ggtggagggc ctgcagttcg tggccgagaa gtatggcttc 2520 

atcgacctga gccgagttgc catccatggc tggtcctacg ggggcttcct ctcgctcatg 2580 

gggctaatcc acaagcccca ggtgttcaag gcccaaccgc ttgcttatcc tccacggctt 2640 

cctggacgaa aacgtgcact ttttccacac aaacttcctc gtctcccaac tgatccgagc 2700 

agggaaacct taccagctcc agatctaccc caacgagaga cacagtattc gctgccccga 2760 

gtcgggcgag cactatgaag tcacgttgct gcactttcta caggaatacc tctgagcctg 2820 

cccaccggga gccgccacat cacagcacaa gtggctgcag cctccgcggg gaaccaggcg 2880 

ggagggactg agtggcccgc gggccccagt gaggcacttt gtcccgccca gcgctggcca 2940 

gccccgagga gccgctgcct tcaccgcccc gacgcctttt atcctttttt aaacgctctt 3000 

gggttttatg tccgctgctt cttggttgcc gagacagaga gatggtggtc tcgggccagc 3060 

ccctcctctc cccgccttct gggaggagga ggtcacacgc tgatgggcac tggagaggcc 3120 

agaagagact cagaggagcg ggctgccttc cgcctggggc tccctgtgac ctctcagtcc 3180 

cctggcccgg ccagccaccg tccccagcac ccaagcatgc aattgcctgt cccccccggc 3240 

cagcctcccc aacttgatgt ttgtgttttg tttgggggga tatttttcat aattatttaa 3300 

aagacaggcc gggcgcggtg gctcacgtct gtaatcccag cactttggga ggctgaggcg 3360 

ggcggatcac ctgaggttgg gagttcaaga ccagcctggc caacatgggg aaaccccgtc 3420 

tctactaaaa atacaaaaaa ttagccgggt gtggtggcgc gtgcctataa tcccagctac 3480 

tcgggaggct gaggcaggag aatcgcttga acccgggagg tggaggttgc ggtgagccaa 3540 

gatcgcacca ttgcactcca gcctgggcaa caagagcgaa actctgtctc aaaataaata 3600 

aaaaataaaa gacagaaagc aaggggtgcc taaatctaga cttggggtcc acaccgggca 3660 

gcggggttgc aacccagcac ctggtaggct ccatttcttc ccaagcccga gcagagggtc 3720 

atgcgggccc cacaggagaa gcggccaggg cccgcggggg gcaccacctg tggacagccc 3780 

tcctgtcccc aagctttcag gcaggcactg aaacgcaccg aacttccacg ctctgctggt 3840 

cagtggcggc tgtcccctcc ccagcccagc cgcccagcca catgtgtctg cctgacccgt 3900 

acacaccagg ggttccgggg ttgggagctg aaccatcccc acctcagggt tatatttccc 3960 

tctccccttc cctccccgcc aagagctctg ccaggggcgg gcaaaaaaaa aagtaaaaag 4020 

aaaagaaaaa aaaaaaaaag aaacaaacca cctctacata ttatggaaag aaaatatttt 4080 

tgtcgattct tattctttta taattatgcg tggaagaagt agacacatta aacgattcca 4140 

gttggaaaca tgtcacctg 4159 
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<210> 32 

<211> 4076 

<212> DNA 

<213> Homo sapiens 

<400> 32 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagatctg ggtcaatgag gagaccaagc tggtgtactt ccagggcacc 1920 
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aaggacacgc cgctggagca ccacctctac gtggtcagct atgaggcggc cggcgagatc 1980 

gtacgcctca ccacgcccgg cttctcccat agctgctcca tgagccagaa cttcgacatg 2040 

ttcgtcagcc actacagcag cgtgagcacg ccgccctgcg tgcacgtcta caagctgagc 2100 

ggccccgacg acgaccccct gcacaagcag ccccgcttct gggctagcat gatggaggca 2160 

gccagctgcc ccccggatta tgttcctcca gagatcttcc atttccacac gcgctcggat 2220 

gtgcggctct acggcatgat ctacaagccc cacgccttgc agccagggaa gaagcacccc 2280 

accgtcctct ttgtatatgg aggcccccag gtgcagctgg tgaataactc cttcaaaggc 2340 

atcaagtact tgcggctcaa cacactggcc tccctgggct acgccgtggt tgtgattgac 2400 

ggcaggggct cctgtcagcg agggcttcgg ttcgaagggg ccctgaaaaa ccaaatgggc 2460 

caggtggaga tcgaggacca ggtggagggc ctgcagttcg tggccgagaa gtatggcttc 2520 

atcgacctga gccgagttgc catccatggc tggtcctacg ggggcttcct ctcgctcatg 2580 

gggctaatcc acaagcccca ggtgttcaag gcccaaccgc ttgcttatcc tccacggctt 2640 

cctggacgaa aacgtgcact ttttccacac aaacttcctc gtctcccaac tgatccgagc 2700 

agggaaacct taccagctcc agatctaccc caacgagaga cacagtattc gctgccccga 2760 

gtcgggcgag cactatgaag tcacgttgct gcactttcta caggaatacc tctgagcctg 2820 

cccaccggga gccgccacat cacagcacaa gtggctgcag cctccgcggg gaaccaggcg 2880 

ggagggactg agtggcccgc gggccccagt gaggcacttt gtcccgccca gcgctggcca 2940 

gccccgagga gccgctgcct tcaccgcccc gacgcctttt atcctttttt aaacgctctt 3000 

gggttttatg tccgctgctt cttggttgcc gagacagaga gatggtggtc tcgggccagc 3060 

ccctcctctc cccgccttct gggaggagga ggtcacacgc tgatgggcac tggagaggcc 3120 

agaagagact cagaggagcg ggctgccttc cgcctggggc tccctgtgac ctctcagtcc 3180 

cctggcccgg ccagccaccg tccccagcac ccaagcatgc aattgcctgt cccccccggc 3240 

cagcctcccc aacttgatgt ttgtgttttg tttgggggga tatttttcat aattatttaa 3300 

aagacaggcc gggcgcggtg gctcacgtct gtaatcccag cactttggga ggctgaggcg 3360 

ggcggatcac ctgaggttgg gagttcaaga ccagcctggc caacatgggg aaaccccgtc 3420 

tctactaaaa atacaaaaaa ttagccgggt gtggtggcgc gtgcctataa tcccagctac 3480 

tcgggaggct gaggcaggag aatcgcttga acccgggagg tggaggttgc ggtgagccaa 3540 

gatcgcacca ttgcactcca gcctgggcaa caagagcgaa actctgtctc aaaataaata 3600 

aaaaataaaa gacagaaagc aaggggtgcc taaatctaga cttggggtcc acaccgggca 3660 

gcggggttgc aacccagcac ctggtaggct ccatttcttc ccaagcccga ctttcaggca 3720 

ggcactgaaa cgcaccgaac ttccacgctc tgctggtcag tggcggctgt cccctcccca 3780 

gcccagccgc ccagccacat gtgtctgcct gacccgtaca caccaggggt tccggggttg 3840 

ggagctgaac catccccacc tcagggttat atttccctct ccccttccct ccccgccaag 3900 

agctctgcca ggggcgggca aaaaaaaaag taaaaagaaa agaaaaaaaa aaaaaagaaa 3960 
caaaccacct ctacatatta tggaaagaaa atatttttgt cgattcttat tcttttataa 1 4020 

ttatgcgtgg aagaagtaga cacattaaac gattccagtt ggaaacatgt cacctg 4076 
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390 




395 




400 


Pro 


Gin 


Gin 


Trp 


Leu 


Gin Leu 


Val Leu Leu 


Pro 


Pro 


Ala Leu Phe He 










405 




410 






415 


Pro 


Ser 


Thr 


Glu 


Asn 


Glu Glu 


Gin Arg Leu 


Ala 


Ser 


Ala Arg Ala Val 








420 






425 






430 


Pro 


Arg 


Asn 


Val 


Gin 


Pro Tyr 


Val Val Tyr 


Glu 


Glu 


Val Thr Asn Val 






435 








440 






445 


Trp 


He 


Asn 


Val 


His 


Asp He 


Phe Tyr Pro 


Phe 


Pro 


Gin Ser Glu Gly 




450 








455 






460 




Glu 


Asp 


Glu 


Leu 


Cys 


Phe Leu 


Arg Ala Asn 


Glu 


Cys 


Lys Thr Gly Phe 


465 










470 




475 




480 


Cys 


His 


Leu 


Tyr 


Lys 


Val Thr 


Ala Val Leu 


Lys 


Ser 


Gin Gly Tyr Asp 










485 




490 






495 


Trp 


Ser 


Glu 


Pro 


Phe 


Ser Pro 


Gly Glu Asp 


Glu 


Phe 


Lys Cys Pro He 








500 






505 






510 


Lys 


Glu 


Glu 


He 


Ala 


Leu Thr 


Ser Gly Glu 


Trp Glu 


Val Leu Ala Arg 






515 






Thr Lys 


520 






525 


His 


Gly 


Ser 


Lys 


Gly 


Asp Thr Pro 


Leu 


Glu 


His His Leu Tyr 




530 








535 






540 




Val 


Val 


Ser 


Tyr 


Glu 


Ala Ala 


Gly Glu He 


Val Arg 


Leu Thr Thr Pro 


545 










550 




555 




560 



44 




WO 02/31 134 PCT/US01/31874 



Gly 


Phe Ser 


rrJ 

His 


Ser 


Cys 


Ser Met 


ser 


vain 


Asn Phe 


TV r-iT-i 

ASp 


Met 


pne 


vai 






565 








570 








575 




Ser 


His Tyr 


Ser 


Ser 


Val 


Ser* Thr 


Pro 


Pro 


cys val 


HIS 


vai 


Tyr 


Lys 




580 








585 








590 






Leu 


Ser Gly 


Pro 


Asp 


Asp 


Asp Pro 


Leu 


His 


Lys Gin 


Pro 


TV u.j ■ 

Arg 


Pne 


Trp 




595 








600 








605 








Ala 


Ser Met 


Met 


Glu 


Ala 


Ala Ser 


Cys 


Pro 


Pro Asp 


Tyr 


vai 


Pro 


Pro 




610 








615 






620 










Glu 


He Phe 


Hxs 


Pne 


His 


Thr Arg 


Ser 


Asp 


vai Arg 


Leu 


Tyr 


oiy 


Mat- 

wee 


625 








630 








635 








640 


He 


Tyr Lys 


Pro 


HIS 


Ala 


Leu Gin 


Pro 


Gly 


Lys Lys 


HIS 


Pro 


xnr 


vai 








645 








650 








655 




Leu 


Phe Val 


Tyr 


Gly 


Gly 


Pro Gin 


Val 


Gin 


Leu Val 


Asn 


Asn 


Ser 


Phe 






660 








665 








670 






Lys 


Gly He 


Lys 


Tyr 


Leu 


Arg Leu 


Asn 


Thr 


Leu Ala 


Ser 


Leu 


Gly 


Tyr 




675 








680 








685 








Ala 


Val Val 


Val 


He 


Asp 


Gly Arg 


Gly 


Ser 


Cys Gin 


Arg 


Gly 


Leu 


Arg 




690 








695 






700 










Phe 


Glu Gly 


Ala 


Leu 


Lys 


Asn Gin 


Met 


Gly 


Gin Val 


Glu 


He 


Glu 


Asp 


705 






710 








715 








720 


Gin 


Val Glu 


Gly 


Leu 


Gin 


Phe Val 


TV 1 _ 

Ala 


Glu 


Lys Tyr 


Gly 


pne 


lie 


Asp 








725 








730 








735 




Leu 


Ser Arg 


Val 


Ala 


He 


His Gly 


Trp 


Ser 


Tyr Gly 


Gly 


Pne 


Leu 


ser 






740 








745 








750 






Leu 


Met Gly 


Leu 


He 


His 


Lys Pro 


Gin 


Val 


Phe Lys 


Val 


Ala 


He 


Ala 




755 








760 








765 








Gly 


Ala Pro 


Val 


Thr 


val 


Trp Met 


Ala 


Tyr 


Asp Thr 


Gly 


Tyr 


Thr 


Glu 




770 








775 






780 










Arg 


Tyr Met 


Asp 


Val 


Pro 


Glu Asn 


Asn 


Gin 


His Gly 


Tyr 


Glu 


Ala 


Gly 


785 








790 








795 








800 


Ser 


val Ala 


Leu 


His 


Val 


Glu Lys 


Leu 


Pro 


Asn Glu 


Pro 


Asn 


Arg 


Leu 








805 








810 








815 




Leu 


He Leu 


His 


Gly 


Phe 


Leu Asp 


Glu 


Asn 


Val His 


Phe 


Phe 


His 


Thr 






620 








825 








830 






Asn 


Phe Leu 


val 


Ser 


Gin 


Leu He 


Arg 


Ala 


Gly Lys 


Pro 


Tyr 


Gin 


Leu 




835 








840 








845 








Gin 


He Tyr 


Pro 


Asn 


Glu 


Arg His 


Ser 


He 


Arg Cys 


Pro 


Glu 


Ser 


Gly 




850 








855 






860 










Glu 


His Tyr 


Glu 


Val 


Thr 


Leu Leu 


His 


Phe 


Leu Gin 


Glu 


Tyr 


Leu 




865 








870 








875 











<210> 34 

<211>* 4263 

<212> DNA 

<213> Homo sapiens 

<400> 34 



caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc eggggtcgea gtcccgggca 120 

tggagccgcg acegtgagge gccgctggac ccgggacgac c'tgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct egctgaatte cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggectcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgeagaaga eggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgeca 600 

tatggcagee gagagaactc cctcctctac tctgagattc ccaagaaggt ceggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctegggagga ggagctgctg agggagegga aacgcctggg ggtcttegge 780 
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atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagggcac caaggacacg ccgctggagc accacctcta cgtggtcagc 1920 

tatgaggcgg ccggcgagat cgtacgcctc accacgcccg gcttctccca tagctgctcc 1980 

atgagccaga acttcgacat gttcgtcagc cactacagca gcgtgagcac gccgccctgc 2040 

gtgcacgtct acaagctgag cggccccgac gacgaccccc tgcacaagca gccccgcttc 2100 

tgggctagca tgatggaggc agccagctgc cccccggatt atgttcctcc agagatcttc 2160 

catttccaca cgcgctcgga tgtgcggctc tacggcatga tctacaagcc ccacgccttg 2220 

cagccaggga agaagcaccc caccgtcctc tttgtatatg gaggccccca ggtgcagctg 2280 

gtgaataact ccttcaaagg catcaagtac ttgcggctca acacactggc ctccctgggc 2340 

tacgccgtgg ttgtgattga cggcaggggc tcctgtcagc gagggcttcg gttcgaaggg 2400 

gccctgaaaa accaaatggg ccaggtggag atcgaggacc aggtggaggg cctgcagttc 2460 

gtggccgaga agtatggctt catcgacctg agccgagttg ccatccatgg ctggtcctac 2520 

gggggcttcc tctcgctcat ggggctaatc cacaagcccc aggtgttcaa ggtggccatc 2580 

gcgggtgccc cggtcaccgt ctggatggcc tacgacacag ggtacactga gcgctacatg 2640 

gacgtccctg agaacaacca gcacggctat gaggcgggtt ccgtggccct gcacgtggag 2700 

aagctgccca atgagcccaa ccgcttgctt atcctccacg gcttcctgga cgaaaacgtg 2760 

cactttttcc acacaaactt cctcgtctcc caactgatcc gagcagggaa accttaccag 2820 

ctccagatct accccaacga gagacacagt attcgctgcc ccgagtcggg cgagcactat 2880 

gaagtcacgt tgctgcactt tctacaggaa tacctctgag cctgcccacc gggagccgcc 2940 

acatcacagc acaagtggct gcagcctccg cggggaacca ggcgggaggg actgagtggc 3000 

ccgcgggccc cagtgaggca ctttgtcccg cccagcgctg gccagccccg aggagccgct 3060 

gccttcaccg ccccgacgcc ttttatcctt ttttaaacgc tcttgggttt tatgtccgct 3120 

gcttcttggt tgccgagaca gagagatggt ggtctcgggc cagcccctcc tctccccgcc 3180 

ttctgggagg aggaggtcac acgctgatgg gcactggaga ggccagaaga gactcagagg 3240 

agcgggctgc cttccgcctg gggctccctg tgacctctca gtcccctggc ccggccagcc 3300 

accgtcccca gcacccaagc atgcaattgc ctgtcccccc cggccagcct ccccaacttg 3360 

atgtttgtgt tttgtttggg gggatatttt tcataattat ttaaaagaca ggccgggcgc 3420 

ggtggctcac gtctgtaatc ccagcacttt gggaggctga ggcgggcgga tcacctgagg 3480 

ttgggagttc aagaccagcc tggccaacat ggggaaaccc cgtctctact aaaaatacaa 3540 

aaaattagcc gggtgtggtg gcgcgtgcct ataatcccag ctactcggga ggctgaggca 3600 

ggagaatcgc ttgaacccgg gaggtggagg ttgcggtgag ccaagatcgc accattgcac 3660 

tccagcctgg gcaacaagag cgaaactctg tctcaaaata aataaaaaat aaaagacaga 3720 

aagcaagggg tgcctaaatc tagacttggg gtccacaccg ggcagcgggg ttgcaaccca 3780 

gcacctggta ggctccattt cttcccaagc ccgagcagag ggtcatgcgg gccccacagg 3840 

agaagcggcc agggcccgcg gggggcacca cctgtggaca gccctcctgt ccccaagctt 3900 

tcaggcaggc actgaaacgc accgaacttc cacgctctgc tggtcagtgg cggctgtccc 3960 

ctccccagcc cagccgccca gccacatgtg tctgcctgac ccgtacacac caggggttcc 4020 

ggggttggga gctgaaccat ccccacctca gggttatatt tccctctccc cttccctccc 4080 

cgccaagagc tctgccaggg gcgggcaaaa aaaaaagtaa aaagaaaaga aaaaaaaaaa 4140 

aaagaaacaa accacctcta catattatgg aaagaaaata tttttgtcga ttcttattct 4200 

tttataatta tgcgtggaag aagtagacac attaaacgat tccagttgga aacatgtcac 4260 

ctg 4263 
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<210> 35 

<211> 879 

<212> PRT 

<213> Homo sapiens 

<400> 35 



Met 


Arg 


Lys 


Val 


Lys Lys Leu Arg 


l 








5 


Trp 


Arg 


Ser 


Phe 


Ser Leu Asn Ser 








20 




Thr 


Gly 


Thr 


Pro 


Thr Ala Asp Arg 






35 




40 


Pro 


Ala 


Ala 


Arg 


Phe Gin Val Gin 




50 






55 


Ser 


He 


He 


His 


Gly Ser Arg Lys 


65 








70 


Ala 


Pro 


His 


Asp 


Phe Gin Phe Val 










85 


His 


Ser 


His 


Arg 


Leu Tyr Tyr Leu 








100 




Asn 


Ser 


Leu 


Leu 


Tyr Ser Glu He 






115 




120 


Leu 


Leu 


Leu 


Leu 


Ser Trp Lys Gin 




130 






135 


Pro 


His 


His 


Gly 


Val Tyr Ser Arg 


145 








150 


Lys 


Arg 


Leu 


Gly 


Val Phe Gly He 










165 


Ser 


Gly 


Leu 


Phe 


Leu Phe Gin Ala 








180 




Asp 


Gly 


Gly 


Lys 


Asn Gly Phe Met 






195 




200 


lie 


Lys 


Thr 


Gin 


Cys Ser Gly Pro 




210 






215 


Ala 


Asp 


Pro 


Ala 


Phe Phe Ser Phe 


225 








230 


Ala 


Asn 


He 


Glu 


Thr Gly Glu Glu 










245 


Gly 


Leu 


Ser 


Asn 


Val Leu Asp Asp 








260 




Phe 


val 


He 


Gin 


Glu Glu Phe Asp 






275 




280 


Pro 


Thr 


Ala 


Ser 


Trp Glu Gly Ser 




290 






295 


Leu 


Tyr 


Glu 


Glu 


Val Asp Glu Ser 


305 








310 


Ser 


Pro 


Ala 


Leu 


Glu Glu Arg Lys 










325 


Thr 


Gly 


Ser 


Lys 


Asn Pro Lys He 








340 




Thr 


Asp 


Ser 


Gin 


Gly Lys He Val 






355 




360 


Gin 


Pro 


Phe 


Ser 


Ser Leu Phe Pro 




370 






375 


Gly 


Trp 


Thr 


Arg 


Asp Gly Lys Tyr 


385 








390 


Pro 


Gin 


Gin 


Trp 


Leu Gin Leu val 



405 



Leu Asp Lys Glu Asn Thr 


Gly 


Ser 


10 


15 




Glu Gly Ala Glu Arg Met 


Ala 


Thr 


25 " 30 






Gly Asp Ala Ala Ala Thr 


Asp 


Asp 


45 






Lys His Ser Trp Asp Gly 


Leu 


Arg 


60 






Tyr Ser Gly Leu He Val 


Asn 


Lys 


75 




80 


Gin Lys Thr Asp Glu Ser 


Gly 


Pro 


90 


95 




Gly Met Pro Tyr Gly Ser 


Arg 


Glu 


105 110 






Pro Lys Lys Val Arg Lys 


Glu 


Ala 


125 






Met Leu Asp His Phe Gin 


Ala 


Thr 


140 






Glu Glu Glu Leu Leu Arg 


Glu 


Arg 


155 




160 


Thr Ser Tyr Asp Phe His 


Ser 


Glu 


170 


175 




Ser Asn Ser Leu Phe His 


Cys 


Arg 


185 190 






Val Ser Pro Met Lys Pro 


Leu 


Glu 


205 






Arg Met Asp Pro Lys He 


Cys 


Pro 


220 






He Asn Asn Ser Asp Leu 


Trp 


Val 


235 




240 


Arg Arg Leu Thr Phe Cys 


His 


Gin 


250 


255 




Pro Lys Ser Ala Gly Val 


Ala 


Thr 


265 270 






Arg Phe Thr Gly Tyr Trp 


Trp 


Cys 


285 






Glu Gly Leu Lys Thr Leu 


Arg 


He 


300 






Glu Val Glu Val He His 


Val 


Pro 


315 




320 


Thr Asp Ser Tyr Arg Tyr 


Pro 


Arg 


330 


335 




Ala Leu Lys Leu Ala Glu 


Phe 


Gin 


345 350 






Ser Thr Gin Glu Lys Glu 


Leu 


Val 


365 






Lys Val Glu Tyr lie Ala 


Arg 


Ala 


380 






Ala Trp Ala Met Phe Leu 


Asp 


Arg 


395 




400 


Leu Leu Pro Pro Ala Leu 


Phe 


lie 


410 


415 
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Pro 


Ser 


Thr 


Glu 








420 


Pro 


Arg 


Asn 


Val 






435 




Tip 


lie 


Asn 


Val 




450 






Glu 


Asp 


Glu 


Leu 


465 








Cys 


His 


Leu 


Tyr 


Trp 


Ser 


GlU 


Pro 








500 


Lys 


Glu 


GlU 


He 






515 




His 


Gly 


Ser 


Lys 




530 






Val 


Val 


Ser 


Tyr 


545 








Gly 


Phe 


Ser 


His 


Ser 


His 


Tyr 


Ser 








580 


Leu 


Ser 


Gly 


Pro 






595 




Ala 


Ser 


Met 


Met 




610 






Glu 


lie 


Phe 


His 


625 








lie 


Tyr 


Lys 


Pro 


Leu 


Phe 


Val 


Tyr 








660 


LVS 


Gly 


He 


Lvs 






675 




Ala 


Val 


Val 


val 




690 






Phe 


Glu 


Gly 


Ala 


705 








Gin 


val 


Glu 


Gly 


Leu 


Ser 


Arg 


Val 








740 


Leu 


Met 


Gly 


Leu 






755 




Gly 


Ala 


Pro 


Val 




770 






Arg 


Tyr 


Met 


Asp 


785 








Ser 


Val 


Ala 


Leu 


Leu 


lie 


Leu 


His 








820 


Asn 


Phe 


Leu 


Val 






835 




Gin 


He 


Tyr 


Pro 




850 






Glu 


His 


Tyr 


Glu 



865 



Asn 


Glu 


Glu 


Gin 


Gin 


Pro 


Tyr 


Val 








440 


His 


Asp 


He 


Phe 






455 




Cys 


Phe 


Leu 


Arg 




470 






Lys 


val 


Thr 


Ala 


485 








Phe 


Ser 


Pro 


Gly 


Ala 


Leu 


Thr 


Ser 








520 


Gly 
* 


Thr 


Lys 


Asp 






535 




GlU 


Ala 


Ala 


Gly 




550 






Ser 


Cys 


Ser 


Met 


565 








Ser 


Val 


Ser 


Thr 


Asp 


Asp 


Asp 


Pro 








600 


Glu 


Ala 


Ala 


Ser 






615 




Phe 


His 


Thr 


Arg 




630 






His 


Ala 


Leu 


Gin 


645 








Gly 


Gly 


Pro 


Gin 


Tyr 


Leu 


Arg 


Leu 








680 


He 


AST) 


Glv 


Arg 






695 




Leu 


Lys 


Asn 


Gin 




710 






Leu 


Gin 


Phe 


Val 


725 








Ala 


He 


His 


Gly 


He 


His 


Lys 


Pro 








760 


Thr 


val 


Trp 


Met 






775 




val 


Pro 


Glu 


Asn 




790 






His 


Val 


Glu 


Lys 


805 








Gly 


Phe 


Leu 


Asp 


Ser 


Gin 


Leu 


He 








840 


Asn 


Glu 


Arg 


His 






855 




val 


Thr 


Leu 


Leu 




870 







Arg 


Leu Ala 


Ser 


425 






Val 


Tyr Glu 


Glu 


Tyr 


Pro Phe 


Pro 






460 


Ala 


Asn Glu 


Cys 




475 




Val 


Leu Lys 


Ser 




490 




Glu 


Asp Glu 


Phe 


505 






Gly 


Glu Trp 


Glu 


Thr 


Pro Leu 


Glu 






540 


Glu 


He Val 


Arq 




555 




Ser 


Gin Asn 


Phe 




570 




Pro 


Pro Cys 


Val 


585 






Leu 


His Lys 


Gin 


Cys 


Pro Pro 


Asp 






620 


Ser 


Asp Val 


Arq 




635 




Pro 


Gly Lys 


Lys 




650 




Val 


Gin Leu 


Val 


665 






Asn 


Thr Leu 


Ala 


Gly 


Ser Cys 


Gin 






700 


Met 


Gly Gin 


Val 




715 




Ala 


Glu Lys 


Tyr 




730 




Trp 


Ser Tyr 


Gly 


745 






Gin 


Val Phe 


Lys 


Ala 


Tyr Asp 


Thr 






780 


Asn 


Gin His 


Gly 




795 




Leu 


Pro Asn 


Glu 




810 




Glu 


Asn Val 


His 


825 






Arg 


Ala Gly 


Lys 


Ser 


He Arg 


Cys 






860 


His 


Phe Leu 


Gin 




875 





Ala 


Arg 


Ala 


Val 




430 






Val 


Thr 


Asn 


Val 


445 








Gin 


Ser 


Glu 


Gly 


Lys 


Thr 


Gly 


Phe 








480 


Gin 


Gly 


Tyr 


Asp 






495 




Lys 


Cys 


Pro 


He 




510 






Val 


Leu 


Ala 


Arg 


525 








His 


His 


Leu 


Tyr 


Leu 


Thr 


Thr 


Pro 








560 


Asp 


Met 


Phe 


Val 






575 




His 


val 


Tyr 


Lys 




590 






Pro 


Arg 


Phe 


Trp 


605 








Tyr 


val 


Pro 


Pro 


Leu 


Tyr 


Gly 


Met 








640 


His 


Pro 


Thr 


Val 






655 




Asn 


Asn 


Ser 


Phe 




670 






Ser 


Leu 


Gly 


Tyr 


685 








Arg 


Gly 


Leu 


Arg 


Glu 


He 


Glu 


Asp 








720 


Gly 


Phe 


He 


Asp 






735 




Gly 


Phe 


Leu 


Ser 




750 






Val 


Ala 


He 


Ala 


765 








Gly 


Tyr 


Thr 


Glu 


Tyr 


Glu 


Ala 


Gly 








800 


Pro 


Asn 


Arg 


Leu 






815 




Phe 


Phe 


His 


Thr 




830 






Pro 


Tyr 


Gin 


Leu 


845 








Pro 


Glu 


Ser 


Gly 


Glu 


Tyr 


Leu 
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<210> 36 

<211> 4180 

<212> DNA 

<213> Homo sapiens 

<400> 36 



caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagggcac caaggacacg ccgctggagc accacctcta cgtggtcagc 1920 

tatgaggcgg ccggcgagat cgtacgcctc accacgcccg gcttctccca tagctgctcc 1980 

atgagccaga acttcgacat gttcgtcagc cactacagca gcgtgagcac gccgccctgc 2040 

gtgcacgtct acaagctgag cggccccgac gacgaccccc tgcacaagca gccccgcttc 2100 

tgggctagca tgatggaggc agccagctgc cccccggatt atgttcctcc agagatcttc 2160 

catttccaca cgcgctcgga tgtgcggctc tacggcatga tctacaagcc ccacgccttg 2220 

cagccaggga agaagcaccc caccgtcctc tttgtatatg gaggccccca ggtgcagctg 2280 

gtgaataact ccttcaaagg catcaagtac ttgcggctca acacactggc ctccctgggc 2340 

tacgccgtgg ttgtgattga cggcaggggc tcctgtcagc gagggcttcg gttcgaaggg 2400 

gccctgaaaa accaaatggg ccaggtggag atcgaggacc aggtggaggg cctgcagttc 2460 

gtggccgaga agtatggctt catcgacctg agccgagttg ccatccatgg ctggtcctac 2520 

gggggcttcc tctcgctcat ggggctaatc cacaagcccc aggtgttcaa ggtggccatc 2580 

gcgggtgccc cggtcaccgt ctggatggcc tacgacacag ggtacactga gcgctacatg 2640 

gacgtccctg agaacaacca gcacggctat gaggcgggtt ccgtggccct gcacgtggag 2700 

aagctgccca atgagcccaa ccgcttgctt atcctccacg gcttcctgga cgaaaacgtg 2760 

cactttttcc acacaaactt cctcgtctcc caactgatcc gagcagggaa accttaccag 2820 

ctccagatct accccaacga gagacacagt attcgctgcc ccgagtcggg cgagcactat 2880 

gaagtcacgt tgctgcactt tctacaggaa tacctctgag cctgcccacc gggagccgcc 2940 

acatcacagc acaagtggct gcagcctccg cggggaacca ggcgggaggg actgagtggc 3000 

ccgcgggccc cagtgaggca ctttgtcccg cccagcgctg gccagccccg aggagccgct 3060 

gccttcaccg ccccgacgcc ttttatcctt ttttaaacgc tcttgggttt tatgtccgct ,3120 

gcttcttggt tgccgagaca gagagatggt ggtctcgggc cagcccctcc tctccccgcc 3180 
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ttctgggagg aggaggtcac acgctgatgg 
agcgggctgc cttccgcctg gggctccctg 
accgtcccca gcacccaagc atgcaattgc 
atgtttgtgt tttgtttggg gggatatttt 
ggtggctcac gtctgtaatc ccagcacttt 
ttgggagttc aagaccagcc tggccaacat 
aaaattagcc gggtgtggtg gcgcgtgcct 
ggagaatcgc ttgaacccgg gaggtggagg 
tccagcctgg gcaacaagag cgaaactctg 
aagcaagggg tgcctaaatc tagacttggg 
gcacctggta ggctccattt cttcccaagc 
gaacttccac gctctgctgg tcagtggcgg 
acatgtgtct gcctgacccg tacacaccag 
cacctcaggg ttatatttcc ctctcccctt 
ggcaaaaaaa aaagtaaaaa gaaaagaaaa 
attatggaaa gaaaatattt ttgtcgattc 
tagacacatt aaacgattcc agttggaaac 

<210> 37 

<211> 819 

<212> PRT 

<213> Homo sapiens 

<400> 37 



gcactggaga ggccagaaga gactcagagg 3240 

tgacctctca gtcccctggc ccggccagcc 3300 

ctgtcccccc cggccagcct ccccaacttg 3360 

tcataattat ttaaaagaca ggccgggcgc 3420 

gggaggctga ggcgggcgga tcacctgagg 3480 

ggggaaaccc cgt etc tact aaaaatacaa 3540 

ataatcccag etacteggga ggctgaggca 3600 

ttgcggtgag ccaagatcgc accattgcac 3660 

tctcaaaata aataaaaaat aaaagacaga 3720 

gtccacaccg ggcagcgggg ttgcaaccca 3780 

ccgactttca ggcaggcact gaaacgcacc 3840 

ctgtcccctc cccagcccag ccgcccagcc 3900 

gggttccggg gttgggagct gaaccatccc 3960 

ccctccccgc caagagctct gecaggggeg 4020 

aaaaaaaaaa gaaacaaacc acctctacat 4080 

ttattctttt ataattatgc gtggaagaag 4140 

atgtcacctg 4180 
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1 
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Ser 


Phe 
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Ser 


Leu 


Asn 


Ser 


Glu 
25 


Gly 


Ala 


Glu 


Arg 


Met 
30 


Ala 


Thr 


Thr 


Gly 


Thr 
35 


Pro 


Thr 


Ala 


Asp 


Arg 
40 


Gly 


Asp 


Ala 


Ala 


Ala 
45 


Thr 
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Asp 


Pro 


Ala 
50 
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Arg 
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Val 
55 
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Lys 


His 


Ser 


Trp 
60 
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Gly 
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Arg 


Ser 


He 


He 


His 


Gly 


Ser 


Arg 


Lys 
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Ser 


Gly 


Leu 


He 


Val 


Asn 


Lys 


65 










70 










75 










80 


Ala 


Pro 


His 


Asp 


Phe 
85 


Gin 


Phe 


Val 


Gin 


Lys 
90 


Thr 
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Glu 


Ser 


Gly 
95 


Pro 


His 


Ser 


His 


Arg 
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Leu 


Tyr 


Tyr 


Leu 


Gly 
105 
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Pro 
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Gly 


Ser 
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Arg 


Glu 


Asn 


Ser 


Leu 
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Glu 
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Lys 
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Leu 


Leu 
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Leu 
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Trp 


Lys 
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Leu 
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Thr 
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His 


His 
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Arg 


Glu 


Glu 


GlU 


Leu 


Leu 


Arg 


Glu 


Arg 
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Lys 


Arg 


Leu 


Gly 


Val 
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Phe 


Gly 


He 


Thr 


Ser 
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Tyr 


Asp 


Phe 


His 


Ser 
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Glu 


Ser 


Gly 


Leu 


Phe 


Leu 


Phe 


Gin 


Ala 


Ser 


Asn 


Ser 


Leu 


Phe 


His 


Cys 


Arg 






180 










185 










190 






Asp 


Gly 


Gly 
195 


Lys 


Asn 


Gly 


Phe 


Met 
200 


Val 


Ser 


Pro 


Met 


Lys 
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Pro 


Leu 


Glu 


He 


Lys 
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Thr 


Gin 


Cys 


Ser 


Gly 
215 


Pro 


Arg 


Met 


Asp 


Pro 
220 


Lys 


lie 


Cys 


Pro 


Ala 


Asp 


Pro 


Ala 


Phe 


Phe 


Ser 


Phe 


He 


Asn 


Asn 


Ser 
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Leu 


Trp 


Val 


225 








230 










235 










240 


Ala 


Asn 


He 


Glu 


Thr 
245 


Gly 


Glu 


Glu 


Arg 


Arg 
250 


Leu 


Thr 


Phe 


Cys 


His 
255 


Gin 


Gly 


Leu 


Ser 


Asn 
260 


Val 


Leu 


Asp 


Asp 


Pro 
265 


Lys 


Ser 


Ala 


Gly 


Val 
270 


Ala 


Thr 
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Phe 


Val 


He 


Gin 


Glu 


Glu 


Phe 


Asp 






275 










280 


Pro 


Thr 


Ala 


Ser 


Trp 


Glu 


Gly 


Ser 
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Leu 
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Glu 


Glu 


Val 
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Glu 


Ser 
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Leu 
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Glu 


Arg 


Lys 
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Lys 


He 
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Thr 
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Ser 
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Gly 
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He 


Val 
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Gin 


Pro 


Phe 


Ser 


Ser 


Leu 


Phe 


Pro 




370 
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Gly 


Trp 


Thr 


Arg 


As P 


Gly 


Lys 


Tyr 


385 
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Pro 


Gin 


Gin 


Trp 


Leu 


Gin 


Leu 


Val 










405 








Pro 


Ser 


Thr 


Glu 


Asn 


Glu 


Glu 


Gin 








420 










Pro 


Arg 


Asn 


Val 


Gin 


Pro 


Tyr 


Val 






435 










440 


Trp 


lie 


Asn 


Val 


His 


Asp 


He 


Phe 




450 










455 




Glu 


Asp 


Glu 


Leu 


Cys 


Phe 


Leu 


Arg 


465 










470 






Cys 


His 


Leu 


Tyr 


Lys 


Val 


Thr 


Ala 










485 








Trp 


Ser 


Glu 


Pro 


Phe 


Ser 


Pro 


Gly 








500 










Lys 


Glu 


Glu 


He 


Ala 


Leu 


Thr 


Ser 






515 
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His 


Gly 


Ser 


Lys 


Gly 


Thr 


Lys 
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530 
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Val 


Val 


Ser 


Tyr 


Glu 
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Ala 


Gly 
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550 






Gly 


Phe 


Ser 


His 


Ser 


Cys 


Ser 


Met 










565 








Ser 


His 


Tyr 


Ser 


Ser 


Val 


Ser 


Thr 








580 










Leu 


Ser 


Gly 


Pro 


Asp 


Asp 


Asp 


Pro 






595 










600 


Ala 


Ser 


Met 


Met 


Glu 


Ala 


Ala 


Ser 




610 










615 




Glu 


lie 


Phe 


His 


Phe 


His 


Thr 


Arg 


625 










630 






lie 


Tyr 


Lys 


Pro 


His 
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Leu 


Gin 










645 








Leu 


Phe 


Val 


Tyr 


Gly 


Gly 


Pro 


Gin 
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Lys 


Gly 


He 


Lys 


Tyr 


Leu 


Arg 


Leu 






675 










680 


Ala 


Val 


Val 


val 


He 


Asp 


Gly 


Arg 




690 










695 




Phe 


Glu 


Gly 


Ala 


Leu 


Lys 


Asn 


Gin 


705 










710 






Gin 


Val 


Glu 


Gly 


Leu 


Gin 


Phe 


Val 



725 



Arg 


Phe 


Thr 


Gly 


Tyr Trp Trp 


Cys 










285 




Glu 


Gly 


Leu 


Lys 


Thr Leu Arg 


He 








300 






Glu 


Val 


Glu 


Val 


He His Val 


Pro 






315 






320 


Thr 


Asp 


Ser 


Tyr 


Arg Tyr Pro 


Arg 




330 






335 




Ala 


Leu 


Lys 


Leu 


Ala Glu Phe 


Gin 


345 








350 




Ser 


Thr 


Gin 


Glu 


Lys Glu Leu 


val 










365 




Lys 


Val 


Glu 


Tyr 


He Ala Arg 


Ala 








380 






Ala 


Trp 


Ala 


Met 


Phe Leu Asp 


Arg 






395 






400 


Leu 


Leu 


Pro 


Pro 


Ala Leu Phe 


He 




410 






415 




Arg 


Leu 


Ala 


Ser 


Ala Arg Ala 


Val 


425 








430 




Val 


Tyr 


Glu 


GlU 


Val Thr Asn 


Val 










445 




Tyr 


Pro 


Phe 


Pro 


Gin Ser Glu 


Gly 








460 






Ala 


Asn 


Glu 


Cys 


Lys Thr Gly 


Phe 






475 






480 


val 


Leu 


Lys 


Ser 


Gin Gly Tyr 


Asp 




490 






495 




Glu 


Asp 


GlU 


Phe 


Lys Cys Pro 


He 


505 








510 




Gly 


GlU 


Trp 


Glu 


Val Leu Ala 


Arg 










525 




Thr 


Pro 


Leu 


Glu 


His His Leu 


Tyr 








540 






Glu 


He 


Val 


Arg 


Leu Thr Thr 


Pro 






555 






560 


Ser 


Gin 


Asn 


Phe 


Asp Met Phe 


Val 




570 






575 




Pro 


Pro 


Cys 


Val 


His Val Tyr 


Lys 


585 
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Leu 


His 


Lys 


Gin 


Pro Arg Phe 


Trp 
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Cys 


Pro 


Pro 


Asp 


Tyr Val Pro 


Pro 
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Ser 


Asp 


Val 


Arg 


Leu Tyr Gly 


Met 
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640 


Pro 


Gly 


Lys 


Lys 


His Pro Thr 


Val 




650 






655 




Val 


Gin 


Leu 


Val 


Asn Asn Ser 


Phe 


665 
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Asn 


Thr 


Leu 


Ala 


Ser Leu Gly 


Tyr 










685 




Gly 


Ser 


Cys 


Gin 


Arg Gly Leu 


Arg 
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Met 


Gly 


Gin 


Val 


Glu He Glu 


Asp 
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Ala 


Glu 


Lys 


Tyr 


Gly Phe He 


Asp 




730 






735 
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Leu 


Ser Arg 


Val 


Ala 


lie 


His Gly Trp Ser Tyr Gly Gly 


pne 


Leu Ser 






740 






745 


750 




Leu 


Met Gly 


Leu 


lie 


HIS 


Lys Pro Gin val Pne Lys Ala 


Gin 


Pro Leu 




755 








760 765 






Ala 


Tyr Pro 


Pro 


Arg 


Leu 


Pro Gly Arg Lys Arg Ala Leu 


Phe 


Pro His 




770 








775 780 






Lys 


Leu Pro 


Arg 


Leu 


Pro 


Thr Asp Pro Ser Arg Glu Thr 


Leu 


Pro Ala 


785 








790 


795 




800 


Pro 


Asp Leu 


Pro 


Gin 


Arg 


Glu Thr Gin Tyr Ser Leu Pro 


Arg Val Gly 








805 




810 




815 


Arg 


Ala Leu 














<210> 38 















<211> 4120 

<212> DNA 

<213> Homo sapiens 

<400> 38 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagggcac caaggacacg ccgctggagc accacctcta cgtggtcagc 1920 

tatgaggcgg ccggcgagat cgtacgcctc accacgcccg gcttctccca tagctgctcc 1980 

atgagccaga acttcgacat gttcgtcagc cactacagca gcgtgagcac gccgccctgc 2040 

gtgcacgtct acaagctgag cggccccgac gacgaccccc tgcacaagca gccccgcttc 2100 

tgggctagca tgatggaggc agccagctgc cccccggatt atgttcctcc agagatcttc 2160 

catttccaca cgcgctcgga tgtgcggctc tacggcatga tctacaagcc ccacgccttg 2220 

cagccaggga agaagcaccc caccgtcctc tttgtatatg gaggccccca ggtgcagctg 2280 

gtgaataact ccttcaaagg catcaagtac ttgcggctca acacactggc ctccctgggc 2340 

tacgccgtgg ttgtgattga cggcaggggc tcctgtcagc gagggcttcg gttcgaaggg 2400 

gccctgaaaa accaaatggg ccaggtggag atcgaggacc aggtggaggg cctgcagttc 2460 
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gtggccgaga agtatggctt catcgacctg agccgagttg ccatccatgg ctggtcctac 2520 

gggggcttcc tctcgctcat ggggctaatc cacaagcccc aggtgttcaa ggcccaaccg 2580 

cttgcttatc ctccacggct tcctggacga aaacgtgcac tttttccaca caaacttcct 2640 

cgtctcccaa ctgatccgag cagggaaacc ttaccagctc cagatctacc ccaacgagag 2700 

acacagtatt cgctgccccg agtcgggcga gcactatgaa gtcacgttgc tgcactttct 2760 

acaggaatac ctctgagcct gcccaccggg agccgccaca tcacagcaca agtggctgca 2820 

gcctccgcgg ggaaccaggc gggagggact gagtggcccg cgggccccag tgaggcactt 2880 

tgtcccgccc agcgctggcc agccccgagg agccgctgcc ttcaccgccc cgacgccttt 2940 

tatccttttt taaacgctct tgggttttat gtccgctgct tcttggttgc cgagacagag 3000 

agatggtggt ctcgggccag cccctcctct ccccgccttc tgggaggagg aggtcacacg 3060 

ctgatgggca ctggagaggc cagaagagac tcagaggagc gggctgcctt ccgcctgggg 3120 

ctccctgtga cctctcagtc ccctggcccg gccagccacc gtccccagca cccaagcatg 3180 

caattgcctg tcccccccgg ccagcctccc caacttgatg tttgtgtttt gtttgggggg 3240 

atatttttca taattattta aaagacaggc cgggcgcggt ggctcacgtc tgtaatccca 3300 

gcactttggg aggctgaggc gggcggatca cctgaggttg ggagttcaag accagcctgg 3360 

ccaacatggg gaaaccccgt ctctactaaa aatacaaaaa attagccggg tgtggtggcg 3420 

cgtgcctata atcccagcta ctcgggaggc tgaggcagga gaatcgcttg aacccgggag 3480 

gtggaggttg cggtgagcca agatcgcacc attgcactcc agcctgggca acaagagcga 3540 

aactctgtct caaaataaat aaaaaataaa agacagaaag caaggggtgc ctaaatctag 3600 

acttggggtc cacaccgggc agcggggttg caacccagca cctggtaggc tccatttctt 3660 

cccaagcccg agcagagggt catgcgggcc ccacaggaga agcggccagg gcccgcgggg 3720 

ggcaccacct gtggacagcc ctcctgtccc caagctttca ggcaggcact gaaacgcacc 3780 

gaacttccac gctctgctgg tcagtggcgg ctgtcccctc cccagcccag ccgcccagcc 3840 

acatgtgtct gcctgacccg tacacaccag gggttccggg gttgggagct gaaccatccc 3900 

cacctcaggg ttatatttcc ctctcccctt ccctccccgc caagagctct gccaggggcg 3960 

ggcaaaaaaa aaagtaaaaa gaaaagaaaa aaaaaaaaaa gaaacaaacc acctctacat 4020 

attatggaaa gaaaatattt ttgtcgattc ttattctttt ataattatgc gtggaagaag 4080 

tagacacatt aaacgattcc agttggaaac atgtcacctg 4120 

<210> 39 

<211> 819 

<212> PRT 

<213> Homo sapiens 

<400> 39 
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Trp 
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Phe 
20 
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Leu 


Asn 


Ser 


Glu 
25 


Gly 


Ala 


Glu 


Arg 


Met 
30 


Ala 


Thr 


Thr 


Gly 


Thr 
35 


Pro 


Thr 


Ala 


Asp 


Arg 
40 


Gly 


Asp 


Ala 


Ala 


Ala 
45 


Thr 


Asp 


Asp 


Pro 


Ala 
50 


Ala 


Arg 


Phe 


Gin 


Val 
55 


Gin 


Lys 


His 


Ser 


Trp 
60 


Asp 


Gly 


Leu 


Arg 


Ser 


He 


He 


His 


Gly 


Ser 


Arg 


Lys 


Tyr 


Ser 


Gly 


Leu 


He 


Val 


Asn 


Lys 


65 










70 










75 










80 


Ala 


Pro 


His 


Asp 


Phe 
85 


Gin 


Phe 


Val 


Gin 


Lys 
90 


Thr 


Asp 


Glu 


Ser 


Gly 
95 


Pro 


His 


Ser 


His 


Arg 
100 


Leu 


Tyr 


Tyr 


Leu 


Gly 
105 


Met 


Pro 


Tyr 


Gly 


Ser 
110 


Arg 


Glu 


Asn 


Ser 


Leu 
115 


Leu 


Tyr 


Ser 


Glu 


He 
120 


Pro 


Lys 


Lys 


Val 


Arg 
125 


Lys 


Glu 


Ala 


Leu 


Leu 
130 


Leu 


Leu 


Ser 


Trp 


Lys 
135 


Gin 


Met 


Leu 


Asp 


His 
140 


Phe 


Gin 


Ala 


Thr 


Pro 


His 


His 


Gly 


Val 


Tyr 


Ser 


Arg 


Glu 


Glu 


Glu 


Leu 


Leu 


Arg 


Glu 


Arg 


145 










150 










155 










160 


Lys 


Arg 


Leu 


Gly 


Val 
165 


Phe 


Gly 


He 


Thr 


Ser 
170 


Tyr 


Asp 


Phe 


His 


Ser 
175 


Glu 


Ser 


Gly 


Leu 


Phe 
180 


Leu 


Phe 


Gin 


Ala 


Ser 
185 


Asn 


Ser 


Leu 


Phe 


His 
190 


Cys 


Arg 
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Asp Gly Gly Lys Asn Gly Phe Met Val Ser Pro Met Lys Pro Leu Glu 

195 200 205 

lie Lys Thr Gin Cys Ser Gly Pro Arg Met Asp Pro Lys lie Cys Pro 

210 215 220 

Ala Asp Pro Ala Phe Phe Ser Phe He Asn Asn Ser Asp Leu Trp Val 
225 230 235 240 

Ala Asn He Glu Thr Gly Glu Glu Arg Arg Leu Thr Phe Cys His Gin 

245 250 255 

Gly Leu Ser Asn Val Leu Asp Asp Pro Lys Ser Ala Gly Val Ala Thr 

260 265 270 

Phe Val He Gin Glu Glu Phe Asp Arg Phe Thr Gly Tyr Trp Trp Cys 

275 280 285 

Pro Thr Ala Ser Trp Glu Gly Ser Glu Gly Leu Lys Thr Leu Arg He 

290 295 300 

Leu Tyr Glu Glu Val Asp Glu Ser Glu Val Glu Val He His Val Pro 
305 310 315 320 

Ser Pro Ala Leu Glu Glu Arg Lys Thr Asp Ser Tyr Arg Tyr Pro Arg 

325 330 335 

Thr Gly Ser Lys Asn Pro Lys He Ala Leu Lys Leu Ala Glu Phe Gin 

340 345 350 

Thr Asp Ser Gin Gly Lys lie Val Ser Thr Gin Glu Lys Glu Leu Val 

355 360 365 

Gin Pro Phe Ser Ser Leu Phe Pro Lys Val Glu Tyr He Ala Arg Ala 

370 375 380 

Gly Trp Thr Arg Asp Gly Lys Tyr Ala Trp Ala Met Phe Leu Asp Arg 
385 390 395 400 

Pro Gin Gin Trp Leu Gin Leu Val Leu Leu Pro Pro Ala Leu Phe He 

405 410 415 

Pro Ser Thr Glu Asn Glu Glu Gin Arg Leu Ala Ser Ala Arg Ala val 

420 425 430 

Pro Arg Asn Val Gin Pro Tyr Val Val Tyr Glu Glu Val Thr Asn Val 

435 440 445 

Trp He Asn Val His Asp lie Phe Tyr Pro Phe Pro Gin Ser Glu Gly 

450 455 460 

Glu Asp Glu Leu Cys Phe Leu Arg Ala Asn Glu Cys Lys Thr Gly Phe 
465 470 475 480 

Cys His Leu Tyr Lys Val Thr Ala Val Leu Lys Ser Gin Gly Tyr Asp 

485 490 495 

Trp Ser Glu Pro Phe Ser Pro Gly Glu Asp Glu Phe Lys Cys Pro He 

500 505 510 

Lys Glu Glu He Ala Leu Thr Ser Gly Glu Trp Glu Val Leu Ala Arg 

515 520 525 

His Gly Ser Lys Gly Thr Lys Asp Thr Pro Leu Glu His His Leu Tyr 

530 535 540 

Val Val Ser Tyr Glu Ala Ala Gly Glu He Val Arg Leu Thr Thr Pro 
545 550 555 560 

Gly Phe Ser His Ser Cys Ser Met Ser Gin Asn Phe Asp Met Phe Val 

565 570 575 

Ser His Tyr Ser Ser Val Ser Thr Pro Pro Cys Val His Val Tyr Lys 

580 585 590 

Leu Ser Gly Pro Asp Asp Asp Pro Leu His Lys Gin Pro Arg Phe Trp 

595 600 605 

Ala Ser Met Met Glu Ala Ala Ser Cys Pro Pro Asp Tyr Val Pro Pro 

610 615 620 

Glu He Phe His Phe His Thr Arg Ser Asp Val Arg Leu Tyr Gly Met 
625 630 635 640 

He Tyr Lys Pro His Ala Leu Gin Pro Gly Lys Lys His Pro Thr Val 



645 



650 



655 
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Leu 


Phe Val 


Tyr 
660 


Gly 


Gly 


Pro 


Gin 


Val 
665 


Gin 


Leu 


Val 


Asn 


Asn 
670 


Ser 


Phe 


Lys 


Gly He 
675 


Lys 


Tyr 


Leu 


Arg 


Leu 
680 


Asn 


Thr 


Leu 


Ala 


Ser 
685 


Leu 


Gly 


Tyr 


Ala 


Val Val 
690 


Val 


He 


Asp 


Gly 
695 


Arg 


Gly 


Ser 


Cys 


Gin 
700 


Arg 


Gly 


Leu 


Arg 


Phe 


Glu .Gly 


Ala 


Leu 


Lys 


Asn 


Gin 


Met 


Gly 


Gin 


val 


Glu 


He 


Glu 


Asp 


705 








710 










715 










720 


Gin 


Val Glu 


Gly 


Leu 
725 


Gin 


Phe 


Val 


Ala 


Glu 
730 


Lys 


Tyr 


Gly 


Phe 


lie 
735 


Asp 


Leu 


Ser Arg 


Val 
740 


Ala 


He 


His 


Gly 


Trp 

745 


Ser 


Tyr 


Gly 


Gly 


Phe 
750 


Leu 


Ser 


Leu 


Met Gly 
755 


Leu 


He 


His 


Lys 


Pro 
760 


Gin 


Val 


Phe 


Lys 


Ala 
765 


Gin 


Pro 


Leu 


Ala 


Tyr Pro 
770 


Pro 


Arg 


Leu 


Pro 
775 


Gly 


Arg 


Lys 


Arg 


Ala 
780 


Leu 


Phe 


Pro 


His 


Lys 


Leu Pro 


Arg 


Leu 


Pro 


Thr 


Asp 


Pro 


Ser 


Arg 


Glu 


Thr 


Leu 


Pro 


Ala 


785 








790 










795 










800 


Pro 


Asp Leu 


Pro 


Gin 
805 


Arg 


Glu 


Thr 


Gin 


Tyr 
810 


Ser 


Leu 


Pro 


Arg 


Val 
815 


Gly 



Arg Ala Leu 



<210> 40 

<211> 4037 

<212> DNA 

<213> Homo sapiens 

<400> 40 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc i440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 
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aggcacggct ccaagggcac caaggacacg 
tatgaggcgg ccggcgagat cgtacgcctc 
atgagccaga acttcgacat gttcgtcagc 
gtgcacgtct acaagctgag cggccccgac 
tgggctagca tgatggaggc agccagctgc 
catttccaca cgcgctcgga tgtgcggctc 
cagccaggga agaagcaccc caccgtcctc 
gtgaataact ccttcaaagg catcaagtac 
tacgccgtgg ttgtgattga cggcaggggc 
gccctgaaaa accaaatggg ccaggtggag 
gtggccgaga agtatggctt catcgacctg 
gggggcttcc tctcgctcat ggggctaatc 
cttgcttatc ctccacggct tcctggacga 
cgtctcccaa ctgatccgag cagggaaacc 
acacagtatt cgctgccccg agtcgggcga 
acaggaatac ctctgagcct gcccaccggg 
gcctccgcgg ggaaccaggc gggagggact 
tgtcccgccc agcgctggcc agccccgagg 
tatccttttt taaacgctct tgggttttat 
agatggtggt ctcgggccag cccctcctct 
ctgatgggca ctggagaggc cagaagagac 
ctccctgtga cctctcagtc ccctggcccg 
caattgcctg tcccccccgg ccagcctccc 
atatttttca taattattta aaagacaggc 
gcactttggg aggctgaggc* gggcggatca 
ccaacatggg gaaaccccgt ctctactaaa 
cgtgcctata atcccagcta ctcgggaggc 
gtggaggttg cggtgagcca agatcgcacc 
aactctgtct caaaataaat aaaaaataaa 
acttggggtc cacaccgggc agcggggttg 
cccaagcccg actttcaggc aggcactgaa 
gtggcggctg tcccctcccc agcccagccg 
acaccagggg ttccggggtt gggagctgaa 
tccccttccc tccccgccaa gagctctgcc 
aagaaaaaaa aaaaaaagaa acaaaccacc 
tcgattctta ttcttttata attatgcgtg 
tggaaacatg tcacctg 

<210> 41 

<211> 706 

<212> PRT 

<213> Homo sapiens 

<400> 41 
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Asp 


val 


Val 
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He 
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Thr 
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Thr 


Thr 








20 










Val 


Thr 


Phe 


Lys 


Ala 


Ser 


Arg 


His 






35 










40 


Val 


Leu 


Leu 


Ala 


Tyr 


Asp 


Val 


Lys 




50 
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Ala 


Ser 


Tyr 


Val 


lie 


Tyr 


Asn 


He 


65 










70 






Asn 
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Pro 


Glu 


Val 


Glu 


Asp 


Ser 










85 








Val 


Gin 


Gly 


Gin 


Gin 


Leu 


He 


Tyr 
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ccgctggagc accacctcta cgtggtcagc 1920 

accacgcccg gcttctccca tagctgctcc 1980 

cactacagca gcgtgagcac gccgccctgc 2040 

gacgaccccc tgcacaagca gccccgcttc 2100 

cccccggatt atgttcctcc agagatcttc 2160 

tacggcatga tctacaagcc ccacgccttg 2220 

tttgtatatg gaggccccca ggtgcagctg 2280 

ttgcggctca acacactggc ctccctgggc 2340 

tcctgtcagc gagggcttcg gttcgaaggg 2400 

atcgaggacc aggtggaggg cctgcagttc 2460 

agccgagttg ccatccatgg ctggtcctac 2520 

cacaagcccc aggtgttcaa ggcccaaccg 2580 

aaacgtgcac tttttccaca caaacttcct 2640 

ttaccagctc cagatctacc ccaacgagag 2700 

gcactatgaa gtcacgttgc tgcactttct 2760 

agccgccaca tcacagcaca agtggctgca 2820 

gagtggcccg cgggccccag tgaggcactt 2880 

agccgctgcc ttcaccgccc cgacgccttt 2940 

gtccgctgct tcttggttgc cgagacagag 3000 

ccccgccttc tgggaggagg aggtcacacg 3060 

tcagaggagc gggctgcctt ccgcctgggg 3120 

gccagccacc gtccccagca cccaagcatg 3180 

caacttgatg tttgtgtttt gtttgggggg 3240 

cgggcgcggt ggctcacgtc tgtaatccca 3300 

cctgaggttg ggagttcaag accagcctgg 3360 

aatacaaaaa attagccggg tgtggtggcg 3420 

tgaggcagga gaatcgcttg aacccgggag 3480 

attgcactcc agcctgggca acaagagcga 3540 

agacagaaag caaggggtgc ctaaatctag 3600 

caacccagca cctggtaggc tccatttctt 3660 

acgcaccgaa cttccacgct ctgctggtca 3720 

cccagccaca tgtgtctgcc tgacccgtac 3780 

ccatccccac ctcagggtta tatttccctc 3840 

aggggcgggc aaaaaaaaaa gtaaaaagaa 3900 

tctacatatt atggaaagaa aatatttttg 3960 

gaagaagtag acacattaaa cgattccagt 4020 

4037 
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He 


Tyr 


Tyr 
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Gin 
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He 
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Ser 
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He 
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He 
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His 
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Val 
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He 
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He 


Thr 
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His 
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Glu 


Thr 
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Arg 
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He 
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Phe 
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Pro 
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Val 
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Pro 
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Lys 
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He 
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He 
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Lys 


Lys 


He 
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His 


He 


Asp 


Asp 


Tyr 


Glu 


Leu 


Pro 




450 
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Met 


Asp 


Arg 


Asn 


Gin 


Tyr 
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465 
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Gly 


G!y 


Gin 


Leu 


Val 


Thr 


Asp 










485 








Val 
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He 


Asp 


Met 


Asp 


Asn 


Val 
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Gly 
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Gly 


Leu 


Lys 
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Leu 


Gly 


Ser 


Val 


Glu 


Val 


Lys 


Asp 
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Leu 


Lys 


Leu 


Pro 


Tyr 


He 


Asp 


Ser 


545 
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Gly 


Tyr 


Gly 


Gly 


Tyr 


He 


Ala 


Ser 
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Leu 
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Leu 


Thr 


Ser 
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Ser 


Gly Lys 
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Asp 
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Leu 
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Tyr 


GlU 


GlU GlU 


Trp 


Trp 


Ser 
155 


Pro 


Asp 


Gly 


Glu Arg 
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Ser 


Leu 
170 


val 


Pro 


Thr 


Met 


Val He 
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Pro 


Lys 


Gly 


Lys 


Gin 


Tyr 


Pro Tyr 


185 
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Thr 


He 


Lys 


Leu 


Tyr 
205 


Val 


Val Asn 


Glu 


Leu 


Met 


Pro 

220 


Pro 


Asp 


Ser Phe 


Met 


val 


Lys 
235 


Trp 


Val 


Ser 


Asn Thr 
240 


Arg 


Pro 
250 


Gin 


Asn 


He 


Ser 


He Leu 
255 


Ala 


Cys 


Ser 


Lys 


Lys 


Tyr 


Glu Met 


265 
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Gin 


Asn 


Glu 


Glu 


Pro 
285 


Val 


Phe Ser 


Thr 


val 


Pro 


Val 
300 


Lys 


Gin 


Gly Gly 


Met 


Phe 


Leu 
315 


He 


Gin 


Ser 


Lys Ser 
320 


Thr 


Ser 
330 


Gly 


Asn 


Trp 


Glu 


val He 
335 


Thr 


Gin 


Lys 


He 


Tyr 


Phe 


Leu Ser 


345 
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Gin 


Leu 


Tyr 


Ser 


Ala 
365 


ser 


Thr Glu 


Ser 


Cys 


Asn 


Phe 
380 


Met 


Lys 


Glu Gin 


Ser 


Pro 


Met 
395 


Asn 


Gin 


His 


Phe Leu 
400 


Pro 


Val 
410 


Val 
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Leu 


His 


Ser Thr 
415 


Leu 


Glu 


Ser 


Asn 


Ser 


Met 


Leu Lys 


425 
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Lys 


Pro 


Glu 


He 
445 


Lys 


He Leu 


Leu 


Gin 


Leu 


Ser 
460 


Leu 


Pro 


Lys Asp 


Leu 


Leu 


Leu 
•475 


He 


Met 


Asp 


Glu Glu 
480 


Lys 


Phe 
490 


His 


He 


Asp 


Trp 


Asp Ser 
495 


He 


val 


Ala 


Arg 


Phe 


Asp 


Gly Arg 


505 
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He 


Leu 


Gin 


Glu 


He 
525 


His 


Arg Arg 


Gin 


He 


Thr 


Ala 
540 


Val 


Lys 


Phe Leu 


Lys 


Arg 


Leu 
555 


Ser 


He 


Phe 


Gly Lys 
560 


Met 


He 
570 


Leu 


Lys 


Ser 


Asp 


Glu Lys 
575 
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Leu 


Phe 


Lys Cys Gly Ser Val 


Val 






580 




Tyr 


Ala 


Ser Ala Phe Ser Glu 


Arg 






595 


600 


Glu 


Ser 


Thr Tyr Gin Ala Ala 


Ser 




610 


615 




Lys 


Glu 


Glu Asn He Leu He 


He 


625 




630 




His 


Phe 


Gin His Ser Ala Glu 


Leu 






645 




Val 


Asn 


Tyr Thr Met Gin Val 


Tyr 






660 




Glu 


Lys 


Ser Lys Tyr His Leu 


Tyr 






675 


680 


Asp 


Cys 


Leu Lys Glu Glu He 


Ser 




690 


695 




Asp 


Glu 






705 









Ala 


Pro 


He 


Thr 


Asp 


Leu 


Lys 


Leu 


585 










590 






Tyr 


Leu 


Gly 


Met 


Pro 
605 


Ser 


Lys 


Glu 


Val 


Leu 


His 


Asn 
620 


Val 


His 


Gly 


Leu 


His 


Gly 


Thr 
635 


Ala 


Asp 


Thr 


Lys 


Val 
640 


lie 


Lys 
650 


His 


Leu 


He 


Lys 


Ala 
655 


Gly 


Pro 


Asp 


Glu 


Gly 


His 


Asn 


Val 


Ser 


665 










670 






Ser 


Thr 


lie 


Leu 


Lys 
685 


Phe 


Phe 


Ser 


Val 


Leu 


Pro 


Gin 
700 


Glu 


Pro 


Glu 


Glu 



<210> 42 

<211> 4541 

<212> DNA 

<213> Homo sapiens 

<400> 42 



gkctykgtkg wtsmagatac agatgtggtg 
ctgaatatag aaacaaatgc taccacatta 
aaagcatcaa gacattcagt ttcaccagat 
aaacagattt ttcattattc gtatactgct 
gaagtttggg agttaaatcc tccagaagta 
ggtgtccaag ggcagcagct gatttatatt 
ataaagagca gttcattgcg actgacatct 
attgctgact ggttatatga agaggaactc 
ccagatggag aaagacttgc cttcctgatg 
atccctcggt ttactggagc gttgtatccc 
ggtcaagtga acccaacaat aaaattatat 
ttggagctca tgccacctga cagctttaaa 
tgggtaagca ataccaagac tgtggtaaga 
ctcacagtct gtgagaccac tacaggtgct 
acgtggctct ctcagcagaa tgaggagccc 
atgacagtgc ctgttaagca agggggacgt 
atccagagta aaagtgagca aattaccgtg 
ataaagatct tggcatacga tgaaactact 
tctcccagag gaaggcagct gtacagtgct 
atttcatgta atttcatgaa agaacaatgt 
aatcaacatt tcttattatt ctgtgaaggt 
acggacaacc cagcaaaata ttttatattg 
ctgaagaaga agataggaaa gccagaaatt 
cctttacagt tgtcccttcc caaagatttt 
ataatggatg aagaaccagg aggccagctg 
tccgtactca ttgacatgga taatgtcatt 
ttccagggtc tgaaaatttt gcaggagatt 
gaccaaataa cagctgtgaa atttttgctg 
agcatttttg gaaagggtta tggtggctat 
aagcttttta aatgtggatc cgtggttgca 
gctttctctg aaagatacct tgggatgcca 
agtgtgctac ataatgttca tggcttgaaa 
gctgacacaa aagttcattt ccaacactca 
ggagtgaatt atactatgca ggtctaccca 



tataaaagcg agaatggaca tgtcattaaa 60 

ttattggaaa acacaacttt tgtaaccttc 120 

ttaaaatatg tccttctggc atatgatgtc 180 

tcatatgtga tttacaacat acacactagg 240 

gaggactccg tcttgcagta cgcggcctgg 300 

tttgaaaata atatctacta tcaacctgat 360 

tctggaaaag aagaaataat ttttaatggg 420 

ctgcattctc acatcgccca ctggtggtca 480 

ataaatgact ctttggtacc caccatggtt 540 

aaaggaaagc agtatccgta tcctaaggca 600 

gttgtaaacc tgtatggacc aactcacact 660 

tcaagagaat actatatcac tatggttaaa 720 

tggttaaacc gacctcagaa catctccatc 780 

tgtagtaaaa aatatgagat gacatcagat 840 

gtgttttcta gagacggcag caaattcttt 900 

ggagaatttc accacatagc tatgttcctc 960 

cggcatctga catcaggaaa ctgggaagtg 1020 

caaaaaattt actttctgag cactgaatct 1080 

tctactgaag gattattgaa tcgccaatgc 1140 

acatattttg atgccagttt tagtcccatg 1200 

ccaagggtcc cagtggtcag cctacatagt 1260 

gaaagcaatt ctatgctgaa ggaagctatc 1320 

aaaatccttc atattgacga ctatgaactt 1380 

atggaccgaa accagtatgc tcttctgtta 1440 

gttacagata agttccatat tgactgggat 1500 

gtagcaagat ttgatggcag aggaagtgga 1560 

catcgaagat taggttcagt agaagtaaag 1620 

aaactgcctt acattgactc caaaagatta 1680 

attgcatcaa tgatcttaaa atcagatgaa 1740 

cctatcacag acttgaaatt gtatgcctca 1800 

tctaaggaag aaagcactta ccaggcagcc i860 

gaagaaaata tattaataat tcatggaact 1920 

gcagaattaa tcaagcacct aataaaagct 1980 

gatgaaggtc ataacgtatc tgagaagagc 2040 
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aagtatcatc tctacagcac aatcctcaaa ttcttcagtg attgtttgaa ggaagaaata 2100 

tctgtgctac cacaggaacc agaagaagat gaataatgga ccgtatttat acagaactga 2160 

agggaatatt gaggctcaat gaaacctgac aaagagactg taatattgta gttgctccag 2220 

aatgtcaagg gcagcttacg gagatgtcac tggagcagca cgctcagaga cagtgaacta 2280 

gcatttgaat acacaagtcc aagtctactg tgttgctagg ggtgcagaac ccgtttcttt 2340 

gtatgagaga ggtcaaaggg ttggtttcct gggagaaatt agttttgcat taaagtagga 2400 

gtagtgcatg ttttcttctg ttatccccct gtttgttctg taactagttg ctctcatttt 2460 

aatttcactg gccaccatca tctttgcata taatgcacaa tctatcatct gtcctacagt 2520 

ccctgatctt tcatggctga gctgcaatct aacactttac tgtaccttta taataagtgc 2580 

aattctttca ttgtctatta ttatgcttaa gaaaatattc agttaataaa aaacagagta 2640 

ttttatgtaa tttctgtttt taaaaagaca ttattaaatg ggtcaaagga catatagaaa 2700 

tgtggatttc agcaccttcc aaagttcagc cagttatcag tagatacaat atctttaaat 2760 

gaacacacga gtgtatgtct cacaatatat atacacaagt gtgcatatac agttaatgaa 2820 

actatcttta aatgttattc atgctataaa gagtaaacgt ttgatgaatt agaagagatg 2880 

ctcttttcca agctataatg gatgctttgt ttaatgagcc aaatatgatg aaacattttt 2940 

tccaattcaa attctagcta ttgctttcct ataaatgttt gggttgtgtt tggtattgtt 3000 

tttagtggtt aatagttttc cagttgcatt taattttttg aatatgatac cttgtcacat 3 060 

gtaaattaga tacttaaata ttaaattata gtttctgata aagaaatttt gttaacaatg 3120 

caatgccact gagtgctatt ttgctctttt ggtggagaag gcttttttca aaactcttgg 3180 

tccttttact tctttctctc agtgcagaat caattctcat tttcatcgta aaagcaaata 3240 

gctggattat ttcatttgcc agtttctatt tagtattcca tgcctgccca attcatctgt 3300 

tactgtttaa tttcaattct tctggtgaga attagaaatg aaatattttt tattcattgg 3360 

ccaaaaagtt cacagacagc agtgtttgct atttactttg aattgaaggc acaaaatgca 3420 

tcaattcctg tgctgtgttg acttgcagta gtaagtaact gagagcataa aataaacctg 3480 

actgtatgaa gtcaatttaa gtgatgagaa catttaactt tggtgactaa agtcagaata 3540 

tcttctcact tcacttaagg gatcttccag aagatatcta aaagtctgta ataagcttag 3600 

aagttcagat aaatctaggc aggatactgc atttttgtgg ttttaaaaaa gtccttagga 3660 

cagactgaat tatcataact tatggcatca ggaggaaact ttaaaatatc aaggaatcac 3720 

tcagtcaccc tcctgttttg ttgaaggatc aaccccaaat tctgggtatt tgagtacatg 3780 

tgaatcatgg atttggtatt caactttttc cctggatgct ttggaatcgt gtcttccatg 3840 

ctccactggg ttcaatttaa aataggagag gctttctctt ctgaaagatc cattttaggt 3 900 

ctttttcaag aatagtgaac acatttttta acaaaataag ttgtaatttt aaaaggaaag 3960 

ttttgcctat tttattaaga tggaaatttc tttttaggct aatttgaaat ccaactgaag 4020 

ctttttaacc aatattttaa atttgaacca ctagagtttt ttatgatgca aatgattatg 4080 

ttgtctgaaa ggtgtggttt tattgaatgt ctatttgagt atcatttaaa aagtatttgc 4140 

cttttactgt catcatttct cttgttttat tattattatc aatgtttatc tatttttcaa 4200 

ttaatttaat acagtttcta atgtgaaaga catttttctg gaacccgttt tccccttaaa 4260 

cactaaagag acctcaagtg aaagcatatt gcttagtagg aaggtagaaa atgttaatcc 4320 

ctgcgattct ttgagtttta atgacagggt cattttcagt aaaggaaatg ctcaccaaca 4380 

catagtcacc aactattaaa ggaatcatgt gattggattt tcccctgtat acatgtaccc 4440 

ttggtcataa tcccactatt tcatacatat ttatgcattg ctagattttc ctaggactcc 4500 

aatagcatgc tttccaagtg ttattattcc cttaatgtta a 4541 



<210> 43 

<211> 691 

<212> PRT 

<213> Homo sapiens 

<400> 43 



Asp Thr Asp Val Val Tyr Lys Ser 
1 5 
Asn lie Glu Thr Asn Ala Thr Thr 
20 

Val Thr Phe Lys Ala Ser Arg His 

35 40 
Val Leu Leu Ala Tyr Asp Val Lys 

50 55 
Ala Ser Tyr Val He Tyr Asn He 
65 70 



Glu Asn Gly His Val He Lys Leu 

10 15 
Leu Leu Leu Glu Asn Thr Thr Phe 
25 30 
Ser Val Ser Pro Asp Leu Lys Tyr 
45 

Gin He Phe His Tyr Ser Tyr Thr 
60 

His Thr Arg Glu Val Trp Glu Leu 
75 80 
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Asn 


Pro 


Pro 


Glu 


Val 


Glu 


Asp 


Ser Val Leu 


Gin Tyr 


Ala 


Ala 


Trp 


Gly 










85 






90 










95 




Val 


Gin 


Gly 


Gin 


Gin 


Leu 


He 


Tyr He Phe 


Glu Asn 


Asn 


He 


Tyr 


Tyr 








100 








105 








110 






Gin 


Pro 


Asp 


He 


Lys 


Ser 


Ser 


Ser Leu Arg 


Leu 


Thr 


Ser 


Ser 


Gly 


Lys 






115 










120 






125 








Glu 


GlU 


He 


He 


Phe 


Asn 


Gly 


He Ala Asp 


Trp Leu 


Tyr 


Glu 


Glu 


Glu 




130 










135 






140 










Leu 


Leu 


His 


Ser 


His 


He 


Ala 


His Trp Trp 


Ser 


Pro 


Asp 


Gly 


Glu 


Arg 


145 










150 






155 










160 


Leu 


Ala 


Phe 


Leu 


Met 


He 


Asn 


Asp Ser Leu 


Val 


Pro 


Thr 


Met 


Val 


He 










165 






170 










175 




Pro 


Arg 


Phe 


Thr 


Gly 


Ala 


Leu 


Tyr Pro Lys 


Gly Lys 


Gin 


Tyr 


Pro 


Tyr 








180 








185 








190 






Pro 


Lys 


Ala 


Gly 


Gin 


Val 


Asn 


Pro Thr lie 


Lys 


Leu 


Tyr 


Val 


Val 


Asn 






195 










200 






205 








Leu 


Tyr 


Gly 


Pro 


Thr 


His 


Thr 


Leu Glu Leu 


Met 


Pro 


Pro 


Asp 


Ser 


Phe 




210 










215 






220 










Lys 


Ser 


Arg 


Glu 


Tyr 


Tyr 


He 


Thr Met Val 


Lys Trp 


Val 


Ser 


Asn 


Thr 


225 










230 






235 










240 


Lys 


Thr 


val 


val 


Arg 


Trp 


Leu 


Asn Arg Pro 


Gin Asn 


lie 


Ser 


He 


Leu 










245 






250 










255 




Thr 


Val 


Cys 


Glu 


Thr 


Thr 


Thr 


Gly Ala Cys 


Ser 


Lys 


Lys 


Tyr 


Glu 


Met 








260 








265 








270 






Thr 


Ser 


Asp 


Thr 


Trp 


Leu 


Ser 


Gin Gin Asn 


Glu 


Glu 


Pro 


Val 


Phe 


Ser 






275 










280 






285 








Arg 


Asp 


Gly 


Ser 


Lys 


Phe 


Phe 


Met Thr Val 


Pro 


Val 


Lys 


Gin 


Gly 


Gly 




290 










295 






300 










Arg 


Gly 


Glu 


Phe 


His 


His 


He 


Ala Met Phe 


Leu 


He 


Gin 


Ser 


Lys 


Ser 


305 










310 






315 










320 


Glu 


Gin 


He 


Thr 


Val 


Arg 


His 


Leu Thr Ser 


Gly Asn 


Trp 


Glu 


Val 


He 










325 






330 










335 




Lys 


He 


Leu 


Ala 


Tyr 


Asp 


GlU 


Thr Thr Gin 


Lys 


He 


Ser 


Ala 


Ser 


Thr 








340 








345 








350 






Glu 


Gly 


Leu 


Leu 


Asn 


Arg 


Gin 


Cys He Ser 


Cys 


Asn 


Phe 


Met 


Lys 


Glu 






355 










360 






365 








Gin 


Cys 


Thr 


Tyr 


Phe 


Asp 


Ala 


Ser Phe Ser 


Pro 


Met 


Asn 


Gin 


His 


Phe 




370 










375 






380 










Leu 


Leu 


Phe 


Cys 


Glu 


Gly 


Pro 


Arg Val Pro 


Val 


Val 


Ser 


Leu 


His 


Ser 


385 










390 






395 










400 


Thr 


Asp 


Asn 


Pro 


Ala 


Lys 


Tyr 


Phe lie Leu 


Glu 


Ser 


Asn 


Ser 


Met 


Leu 










405 






410 










415 




Lys 


GlU 


Ala 


He 


Leu 


Lys 


Lys 


Lys He Gly 


Lys 


Pro 


Glu 


He 


Lys 


He 








420 








425 








430 






Leu 


His 


He 


Asp 


Asp 


Tyr 


GlU 


Leu Pro Leu 


Gin 


Leu 


Ser 


Leu 


Pro 


Lys 






435 










440 






445 








Asp 


Phe 


Met 


Asp 


Arg 


Asn 


Gin 


Tyr Ala Leu 


Leu 


Leu 


He 


Met 


Asp 


Glu 




450 










455 






460 










Glu 


Pro 


Gly 


Gly 


Gin 


Leu 


val 


Thr Asp Lys 


Phe 


His 


He 


Asp 


Trp 


Asp 


465 










470 






475 










480 


Ser 


Val 


Leu 


He 


Asp 


Met 


Asp 


Asn Val He 


Val 


Ala 


Arg 


Phe 


Asp 


Gly 










485 






490 










495 




Arg 


Gly 


Ser 


Gly 


Phe 


Gin 


Gly 


Leu Lys lie 


Leu 


Gin 


Glu 


lie 


His 


Arg 








500 








505 








510 






Arg 


Leu 


Gly 


Ser 


Val 


Glu 


Val 


Lys Asp Gin 


He 


Thr 


Ala 


val 


Lys 


Phe 






515 










520 






525 








Leu 


Leu 


Lys 


Leu 


Pro 


Tyr 


He 


Asp Ser Lys 


Arg 


Leu 


Ser 


He 


Phe 


Gly 




530 










535 






540 
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Lys 


Gly 


Tyr 


Gly 


Gly 


Tyr 


He 


Ala 


Ser 


Met 


lie 


Leu 


Lys 


Ser 


Asp 


Glu 


545 










550 










555 










560 


Lys 


Leu 


Phe 


Lys 


Cys 
565 


Gly 


Ser 


Val 


Val 


Ala 

570 


Pro 


He 


Thr 


Asp 


Leu 
575 


Lys 


Leu 


Tyr 


Ala 


Ser 
580 


Ala 


Phe 


Ser 


Glu 


Arg 
585 


Tyr 


Leu 


Gly 


Met 


Pro 
590 


Ser 


Lys 


Glu 


Glu 


Ser 
595 


Thr 


Tyr 


Gin 


Ala 


Ala 
600 


Ser 


Val 


Leu 


His 


Asn 
605 


Val 


His 


Gly 


Leu 


Lys 
610 


Glu 


Glu 


Asn 


He 


Leu 
615 


He 


He 


His 


Gly 


Thr 
620 


Ala 


Asp 


Thr 


Lys 


val 


His 


Phe 


Gin 


His 


Ser 


Ala 


Glu 


Leu 


He 


Lys 


His 


Leu 


He 


Lys 


Ala 


625 










630 










635 










640 


Gly 


Val 


Asn 


Tyr 


Thr 
645 


Met 


Gin 


Val 


Tyr 


Pro 
650 


Asp 


Glu 


Gly 


His 


Asn 
655 


Val 


Ser 


Glu 


Lys 


Ser 
660 


Lys 


Tyr 


His 


Leu 


Tyr 
665 


Ser 


Thr 


He 


Leu 


Lys 
670 


Phe 


Phe 


Ser 


Asp 


Cys 
675 


Leu 


Lys 


Glu 


Glu 


He 
680 


Ser 


val 


Leu 


Pro 


Gin 
685 


GlU 


Pro 


Glu 



Glu Asp GlU 
690 



<210> 44 

<211> 4496 

<212> DNA 

<213> Homo sapiens 

<400> 44 

gkctykgtkg wtsmagatac agatgtggtg tataaaagcg agaatggaca tgtcattaaa 60 

ctgaatatag aaacaaatgc taccacatta ttattggaaa acacaacttt tgtaaccttc 120 

aaagcatcaa gacattcagt ttcaccagat ttaaaatatg tccttctggc atatgatgtc 180 

aaacagattt ttcattattc gtatactgct tcatatgtga tttacaacat acacactagg 240 

gaagtttggg agttaaatcc tccagaagta gaggactccg tcttgcagta cgcggcctgg 300 

ggtgtccaag ggcagcagct gatttatatt tttgaaaata atatctacta tcaacctgat 360 

ataaagagca gttcattgcg actgacatct tctggaaaag aagaaataat ttttaatggg 420 

attgctgact ggttatatga agaggaactc ctgcattctc acatcgccca ctggtggtca 480 

ccagatggag aaagacttgc cttcctgatg ataaatgact ctttggtacc caccatggtt 540 

atccctcggt ttactggagc gttgtatccc aaaggaaagc agtatccgta tcctaaggca 600 

ggtcaagtga acccaacaat aaaattatat gttgtaaacc tgtatggacc aactcacact 660 

ttggagctca tgccacctga cagctttaaa tcaagagaat actatatcac tatggttaaa 720 

tgggtaagca ataccaagac tgtggtaaga tggttaaacc gacctcagaa catctccatc 780 

ctcacagtct gtgagaccac tacaggtgct tgtagtaaaa aatatgagat gacatcagat 840 

acgtggctct ctcagcagaa tgaggagccc gtgttttcta gagacggcag caaattcttt 900 

atgacagtgc ctgttaagca agggggacgt ggagaatttc accacatagc tatgttcctc 960 

atccagagta aaagtgagca aattaccgtg cggcatctga catcaggaaa ctgggaagtg 1020 

ataaagatct tggcatacga tgaaactact caaaaaatca gtgcttctac tgaaggatta 1080 

ttgaatcgcc aatgcatttc atgtaatttc atgaaagaac aatgtacata ttttgatgcc 1140 

agttttagtc ccatgaatca acatttctta ttattctgtg aaggtccaag ggtcccagtg 1200 

gtcagcctac atagtacgga caacccagca aaatatttta tattggaaag caattctatg 1260 

ctgaaggaag ctatcctgaa gaagaagata ggaaagccag aaattaaaat ccttcatatt 1320 

gacgactatg aacttccttt acagttgtcc cttcccaaag attttatgga ccgaaaccag 1380 

tatgctcttc tgttaataat ggatgaagaa ccaggaggcc agctggttac agataagttc 1440 

catattgact gggattccgt actcattgac atggataatg tcattgtagc aagatttgat 1500 

ggcagaggaa gtggattcca gggtctgaaa attttgcagg agattcatcg aagattaggt 1560 

tcagtagaag taaaggacca aataacagct gtgaaatttt tgctgaaact gccttacatt 1620 

gactccaaaa gattaagcat ttttggaaag ggttatggtg gctatattgc atcaatgatc 1680 

ttaaaatcag atgaaaagct ttttaaatgt ggatccgtgg ttgcacctat cacagacttg 1740 

aaattgtatg cctcagcttt ctctgaaaga taccttggga tgccatctaa ggaagaaagc 1800 

acttaccagg cagccagtgt gctacataat gttcatggct tgaaagaaga aaatatatta 1860 

ataattcatg gaactgctga cacaaaagtt catttccaac actcagcaga attaatcaag 1920 
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cacctaataa aagctggagt gaattatact atgcaggtct acccagatga aggtcataac 1980 

gtatctgaga agagcaagta tcatctctac agcacaatcc tcaaattctt cagtgattgt 2040 

ttgaaggaag aaatatctgt gctaccacag gaaccagaag aagatgaata atggaccgta 2100 

tttatacaga actgaaggga atattgaggc tcaatgaaac ctgacaaaga gactgtaata 2160 

ttgtagttgc tccagaatgt caagggcagc ttacggagat gtcactggag cagcacgctc 2220 

agagacagtg aactagcatt tgaatacaca agtccaagtc tactgtgttg ctaggggtgc 2280 

agaacccgtt tctttgtatg agagaggtca aagggttggt ttcctgggag aaattagttt 2340 

tgcattaaag taggagtagt gcatgttttc ttctgttatc cccctgtttg ttctgtaact 2400 

agttgctctc attttaattt cactggccac catcatcttt gcatataatg cacaatctat 2460 

catctgtcct acagtccctg atctttcatg gctgagctgc aatctaacac tttactgtac 2520 

ctttataata agtgcaattc tttcattgtc tattattatg cttaagaaaa tattcagtta 2580 

ataaaaaaca gagtatttta tgtaatttct gtttttaaaa agacattatt aaatgggtca 2640 

aaggacatat agaaatgtgg atttcagcac cttccaaagt tcagccagtt atcagtagat 2700 

acaatatctt taaatgaaca cacgagtgta tgtctcacaa tatatataca caagtgtgca 2760 

tatacagtta atgaaactat ctttaaatgt tattcatgct ataaagagta aacgtttgat 2820 

gaattagaag agatgctctt ttccaagcta taatggatgc tttgtttaat gagccaaata 2880 

tgatgaaaca ttttttccaa ttcaaattct agctattgct ttcctataaa tgtttgggtt 2940 

gtgtttggta ttgtttttag tggttaatag ttttccagtt gcatttaatt ttttgaatat 3000 

gataccttgt cacatgtaaa ttagatactt aaatattaaa ttatagtttc tgataaagaa 3060 

attttgttaa caatgcaatg ccactgagtg ctattttgct cttttggtgg agaaggcttt 3120 

tttcaaaact cttggtcctt ttacttcttt ctctcagtgc agaatcaatt ctcattttca 3180 

tcgtaaaagc aaatagctgg attatttcat ttgccagttt ctatttagta ttccatgcct 3240 

gcccaattca tctgttactg tttaatttca attcttctgg tgagaattag aaatgaaata 3300 

ttttttattc attggccaaa aagttcacag acagcagtgt ttgctattta ctttgaattg 3360 

aaggcacaaa atgcatcaat tcctgtgctg tgttgacttg cagtagtaag taactgagag 3420 

cataaaataa acctgactgt atgaagtcaa tttaagtgat gagaacattt aactttggtg 3480 

actaaagtca gaatatcttc tcacttcact taagggatct tccagaagat atctaaaagt 3540 

ctgtaataag cttagaagtt cagataaatc taggcaggat actgcatttt tgtggtttta 3600 

aaaaagtcct taggacagac tgaattatca taacttatgg catcaggagg aaactttaaa 3660 

atatcaagga atcactcagt caccctcctg ttttgttgaa ggatcaaccc caaattctgg 3720 

gtatttgagt acatgtgaat catggatttg gtattcaact ttttccctgg atgctttgga 3780 

atcgtgtctt ccatgctcca ctgggttcaa tttaaaatag gagaggcttt ctcttctgaa 3840 

agatccattt taggtctttt tcaagaatag tgaacacatt ttttaacaaa ataagttgta 3900 

attttaaaag gaaagttttg cctattttat taagatggaa atttcttttt aggctaattt 3960 

gaaatccaac tgaagctttt taaccaatat tttaaatttg aaccactaga gttttttatg 4020 

atgcaaatga ttatgttgtc tgaaaggtgt ggttttattg aatgtctatt tgagtatcat 4080 

ttaaaaagta tttgcctttt actgtcatca tttctcttgt tttattatta ttatcaatgt 4140 

ttatctattt ttcaattaat ttaatacagt ttctaatgtg aaagacattt ttctggaacc 4200 

cgttttcccc ttaaacacta aagagacctc aagtgaaagc atattgctta gtaggaaggt 4260 

agaaaatgtt aatccctgcg attctttgag ttttaatgac agggtcattt tcagtaaagg 4320 

aaatgctcac caacacatag tcaccaacta ttaaaggaat catgtgattg gattttcccc 4380 

tgtatacatg tacccttggt cataatccca ctatttcata catatttatg cattgctaga 4440 

ttttcctagg actccaatag catgctttcc aagtgttatt attcccttaa tgttaa 4496 

<210> 45 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<400> 45 

cggtaccatg gcagcagcaa tggaaacag 29 

<210> 46 

<211> 39 

<212> DNA 

<213> Horao sapiens 

<400> 46 

ggagctcgcg gccgctcata tcacttttag agcagcaat 39 
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<210> 47 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 47 

caagctttat cacttttaga gcagcaa 

<210> 48 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 48 

cacattcttg ctgcatcagt ca 

<210> 49 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 49 

ttgggtcatc ttcaggactt ga 

<210> 50 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 50 

caagcttacc atggccacca ccgggac 

<210> 51 

<211> 37 

<212> DNA 

<213> Homo sapiens 

<400> 51 

cggatccgcg gccgctcaga ggtattcctg tagaaag 

<210> 52 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 52 

cggatccagg tattcctgta gaaagtg 

<210> 53 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 53 

tacgccgtgg ttgtgattga 

<210> 54 
<211> 20 
<212> DNA 
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27 



22 



22 



27 



37 



27 
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<213> Homo sapiens 
<400> 54 

ccatacttct cggccacgaa 

<210> 55 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 55 

gcctgggatt gtgcactgt 

<210> 56 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<400> 56 

gtgtattcaa atgctagttc actgtctct 

<210> 57 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 57 

agctagcact gtccagggtc ct 

<210> 58 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 58 

agggcccttc atcttcttct ggttc 

<210> 59 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 59 

Val Glu Asp Asp Val Met Glu Arg Gin Arg Leu lie Glu Ser Val Pro 
15 10 15 

Asp Ser Val 

<210> 60 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 60 

Ser Thr Glu Asn Glu Glu Gin Arg Leu Ala Ser Ala Arg Ala Val Pro 



1 

Arg Asn Val 



5 



10 



15 



<210> 61 
<211> 15 
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<212> PRT 

<213> Homo sapiens 

<400> 61 

Lys Glu Ala lie Leu Lys Lys Lys lie Gly Lys Pro Glu lie Lys 
1 5 10 15 
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